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Abstract 


We  explore  the  following  matrix  problem:  Given  an  n  x  n  boolean  matrix,  is  there  a  permu¬ 
tation  of  the  rows  and  a  permutation  of  the  columns  such  that  the  resulting  matrix  is  lower 
triangular?  We  show  the  relationship  of  this  matrix  problem  to  the  two  important  schedul¬ 
ing  problems:  optimization  of  code  for  pipelined  execution  and  microcode  compaction  for 
very  long  instruction  computers. 

This  matrix  problem  is  unclassified — it  is  unknown  whether  it  is  iVP-Complete  or 
whether  it  can  be  solved  by  a  polynomial  time  algorithm.  We  find  several  minor  exten- 
sions  that  would  make  the  problem  jNTP- Complete.  Also,  we  show  polynomial  algorithms 
for  a  number  of  special  cases  of  the  problem,  and  develop  a  number  of  interesting  techniques 
in  the  process.  We  also  explore  approximation  algorithms  and  lower  bounds. 


iii 


Acknowledgements 


Foremost,  I  would  like  to  thank  my  advisor  Donald  Knuth.  He  started  out  by  saving  me  from 
the  sirens  of  Artificial  Intelligence.  Then  he  provided  me  with  a  fertile  environment  in  which 
to  explore  many  problems.  Discussions  with  him  proved  stimulating  as  he  indefatigably 
poked  holes  in  my  proofs,  gave  me  new  ideas  and  guidance,  and  recalled  the  just-right 
obscure  reference  in  the  literature.  Also,  while  I  will  never  acquire  his  ability  for  precision 
and  clarity  in  thought  and  writing,  I  will  be  better  off  from  having  seen  his  example  and 
from  being  pushed  into  trying. 

Besides  the  constructive  comments  he  made  on  this  thesis,  Ernst  Mayr  was  the  pro¬ 
fessor  at  Stanford  whose  numerous  courses  and  seminars  provided  the  greatest  service  and 
inspiration  to  young  algorithms  students  of  my  class.  At  least  we  students  at  Stanford  were 
very  appreciative. 

I  also  would  like  to  thank  my  third  reader  Andrew  Goldberg.  I  only  wish  that  he  had 
come  to  Stanford  earlier  in  my  career  so  that  I  would  have  had  more  opportunities  to  learn 
from  him. 

The  papers  that  I  co-authored  while  at  Stanford  provided  another  great  source  of  learn¬ 
ing  and  enjoyment  at  Stanford.  I  would  like  to  thank  all  my  co-authors  for  allowing  me 
to  enter  their  domains  and  for  tolerating  my  leaden  pace  of  writing:  Evan  Cohn,  Anton 
Dahbura,  Joan  Feigenbaum,  Jeff  Naughton,  Shaibal  Roy,  Alejandro  Schaffer. 

Numerous  other  friends  also  helped  make  the  many  years  go  by  faster:  Elizabeth  Buss, 
Helen  Davis,  Marcia  Derr,  Scott  Gimpel,  Keith  Hall,  Peter  Karp,  Andrew  Kosoresow,  Tracy 
Larrabee,  Sherry  Listgarten,  Oren  Patashnik,  Karen  Peiper,  Evan  Reid,  and  others. 

I  am  also  indebted  to  my  colleagues  at  the  Western  Research  Lab  of  the  Digital  Equip¬ 
ment  Corporation,  who  actively  encouraged  me  during  the  final  writing  stretch.  Equally 
important,  they  patiently  tolerated  bugs  in  my  computer  programs. 

I  received  financial  support  from  a  variety  of  sources.  My  first  year  at  Stanford  was 


iv 


funded  by  a  General  Electric  Foundation  FeUowship.  Subsequent  years  were  funded  by 
grants  from  DARPA,  NSF  and  ONR.  Much  of  the  write-up  of  the  thesis  was  supported  by 
DEC  Western  Research  Lab. 

Next,  I’d  like  to  thank  David  Robbins.  He  is  the  high  school  math  professor  who  first 
took  me  aside  and  suggested  that  there  was  more  to  computers  than  computer  games.  He 
directed  me  to  a  multi- volume  set  of  books  in  the  library.  Local  zoning  limited  buildings  to 
four  fioors,  so  they  numbered  the  floors  of  the  library:  ground  floor,  mezzanine,  first  floor, 
first  mezzanine,  second  floor,  second  mezzanine,  third  floor,  third  mezzanine,  fourth  floor. 
I  can  remember  it  clearly  now:  there  I  was  sitting  in  the  mammoth  library  of  one  of  the 
best  prep  schools  in  the  country,  wondering,  “Why  are  they  so  cheap  that  they  only  bought 
the  first  three  volumes?” 

Lastly,  my  mother  and  father  have  always  emphasized  the  importance  of  education  and 
have  made  numerous  sacrifices  to  ensure  that  I  had  the  best.  I  will  always  be  greatly 
indebted  to  them. 


Contents 


Abstract 

• 

Acknowledgements 

1  Introduction  ^ 

1.1  Probing  the  Innards .  1 

2  Basics  and  Warmups  2 

2.1  The  Basic  Problem .  ^ 

2.1.1  Viewed  as  a  Graph  Problem .  3 

2.1.2  Implied  Orderings .  7 

2.2  More  Basics . .  •  • 

2.3  Variants . 

2.3.1  Non-unit  execution  times .  16 

2.3.2  Allowing  x  ^  ®  Constraints .  18 

2.3.3  All  Together  .  20 

2.3.4  Contractor  Cash-Flow .  21 

2.4  Allowing  y  —*  x  Constraints .  21 

2.4.1  Microcode  Compaction  for  Very  Long  Instruction  Word  Computers  .  22 

2.5  Pipeline  Scheduling .  26 

2.6  Precedence  Constraints  .  83 

3  Well-Ordered  Schedules  8^ 

3.1  Definitions .  84 

3.1.1  Charts .  85 

3.2  Two  Is  Per  Row .  87 

vi 


3.2.1  Component  Types . 

3.2.2  Ordering  the  Components .  39 

3.2.3  Transposition  of  a  Chart . 

3.2.4  Indivisible  Components? .  ^^3 

3.3  Merging . 

3.3.1  Peak  Balance .  ^9 

3.3.2  Basic  Lemmas  .  ^9 

3.3.3  Segregation . 

3.3.4  Decompose . 

3.3.5  APD- Segregated  Schedule .  56 

3.3.6  Well-Ordered  Optimum .  59 

3.3.7  Fundamental  Charts . . .  53 

3.3.8  Nearly  Well-Ordered  Optimum  Schedules .  55 

3.3.9  Merge .  58 

3.4  Weighted  Chains .  ^9 

4  One,  Two,  Three,  Approximation  "^4 

4.1  Nodes  with  degree  .  . .  ^4 

4.1.1  Scheduling  trees  in  polynomial  time .  74 

4.2  Nodes  with  degree  .  . .  75 

4.2.1  Collapsing  Transformation .  75 

4.3  Nodes  with  degree  <3 .  78 

4.3.1  Expanding  Transformation .  78 

4.3.2  Three  ones  per  row .  79 

4.4  Polynomial  Solutions  to  Overhang-Limited  PLATEAUs .  80 

4.4.1  3-Regular  Graphs .  83 

4.5  Approximations .  84 

5  Lower  Bound  Techniques  ®9 

5.1  Simple  Methods .  90 

5.2  Permanents  and  Perfect  Matchings . .  •  •  93 

5.2.1  Subarrays .  95 

5.3  Network  Flow .  97 

5.4  Minimal  Graphs  .  193 

vii 


5.4.1  Simple  Minimals  .  .  . 

5.4.2  Disconnected  Minimals 


103 

106 


6  Games 

6.1  A  Game  Theory  Problem . 

6.2  Misere  Play . 

6.2.1  A  Misere  Hackenbush  variant  . 

6.3  Normal  Play  . 


109 

109 

110 
110 
111 


7  Conclusions 


113 


Bibliography 


116 


viii 


Chapter  1 


Introduction 


1.1  Probing  the  Innards 

The  triangularization  problem  that  this  thesis  focuses  on  is  deceptively  simple  to  state: 

Given  an  n  X  n  boolean  matrix,  is  there  a  permutation  of  the  rows  and  a  per¬ 
mutation  of  the  columns  such  that  the  resulting  matrix  is  lower  triangular? 

This  simplicity  of  definition  is  one  of  the  things  that  makes  this  problem  interesting. 
Another  is  that  the  problem  is  elusive:  No  one  has  been  able  to  show  either  that  the  problem 
is  in  P  or  that  the  problem  is  ATP-Complete.  For  the  theoretically  inclined,  these  two  things 
are  enough  to  warrant  further  investigation:  simple,  yet  elusive.  There  must  be  something 
interesting  going  on  here. 

The  practical  minded  may  not  be  so  easily  intrigued.  Yet,  there  is  something  here  for 
them,  too.  For  the  problem  can  be  viewed  as  part  of  a  very  fundamental  and  useful  class: 
the  class  of  scheduling  problems.  The  tools  developed  as  we  dissect  and  probe  the  innards 
of  the  triangularization  problem  will  surely  be  useful  in  other  explorations. 
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Chapter  2 


Basics  and  Warmups 


2.1  The  Basic  Problem 

We  will  focus  first  on  a  way  to  formulate  the  triangularization  problem  as  a  scheduling 
problem.  A  comprehensive  recent  overview  of  work  on  other  scheduling  problems  can  be 
found  in  [LLRS85]. 

We  have  two  processors,  px  *•^<1  PY^  jobs,  X  = 

I'  =  {j/i.  •  •  ".yn}-  We  want  to  execute  the  jobs  in  the  minimum  amount  of  time.  There  are 

restrictions  that  we  must  obey: 

•  First,  the  X  jobs  must  be  executed  on  processor  px  and  the  Y  jobs  must  be  executed 
on  processor  py.  There  are  two  ways  to  view  this.  We  can  view  the  two  processors  as 
being  different;  for  example,  one  processor  can  perform  only  floating  point  operations 
and  one  can  perform  only  integer  operations.  Alternately,  we  can  consider  the  jobs  as 
being  pre- assigned  to  processors. 

•  Second,  each  job  takes  one  unit  of  time  to  execute.  Whether  this  unit  is  a  minute  or  a 
micro-second  is  irrelevant — just  so  long  as  all  the  jobs  take  the  same  amount  of  time. 

•  Third,  the  jobs  are  non-pre-emptive.  That  is,  once  a  job  is  started,  it  runs  to  comple¬ 
tion;  the  job  can  not  be  halted  and  then  restarted. 

•  Fourth,  there  exist  Xi  — »  yj  constraints  between  jobs.  The  constraint  x\  — »  ya  means 
that  job  xi  must  be  completed  before  job  yz  can  be  started. 
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X  =  {a:i,®2)®3)®4} 

y  =  {yi,y2,y3,i/4} 

®1  —*■  yi)  ®2  !/l>  ®3  — y2i  ®3  — ^  2/3)  ®3  — >  2/4)  l4  — ►  y4 

=  [®3)®4)®1)®2)0] 

Sy  =  [0)2/2)  2/3)  2/4)  2/1  ] 

Figure  2.1;  A  valid  schedule 

The  specification  of  what  job  to  execute  at  what  time  is  referred  to  as  a  schedule.  The 
schedule  5  =  [®3,®i)0)®2]  means  that  at  time  =  0  we  start  executing  job  X3.  When  it 
finishes  at  time  =  1,  we  start  job  ij.  Between  time  =  2  and  time  =  3  we  execute  no  job; 
0  will  sometimes  be  referred  to  as  an  idle  job.  At  time  =  3  we  start  executing  job  X2.  This 
job  and  the  schedule  terminate  at  time  =  4. 

For  our  problem,  we  will  need  two  schedules,  Sx  and  Sy,  to  specify  when  the  jobs  are 
executed  on  processors  px  and  py ,  respectively. 

A  pair  of  schedules  {Sx t  ^y)  valid  if:  (1)  Sx  contains  each  X  job  exactly  once  and 
contains  no  V  jobs;  (2)  Sy  contains  each  Y  job  exactly  once  and  contains  no  X  jobs;  (3)  all 
Xi  -*  Vj  constraints  are  satisfied.  The  schedule  (Sa^, 5r)  in  Figure  2.1  is  valid.  Indeed  as  we 
will  have  little  need  to  refer  to  invalid  schedules,  we  will  usually  refer  to  “valid  schedules” 
merely  as  “schedules”. 

The  length  of  a  schedule  is  the  largest  time  when  a  non-0  job  is  executing.  So  the  length 
of  {Sx,Sy)  of  Figure  2.1  is  5  time  units,  since  py’s  last  non-0  job  executes  at  time  =  5. 
Since  there  is  no  valid  {Sx^Sy)  schedule  with  length  <  5,  the  schedule  is  a  minitnum  time 
schedule  or  an  optimum  schedule. 

And  this  is  our  task:  to  find  an  optimum  schedule. 

2.1.1  Viewed  as  a  Graph  Problem 

Frequently,  it  will  be  convenient  to  think  of  the  set  of  constraints  as  a  graph,  rather  than 
merely  as  a  set  of  Xi  y,  constraints.  In  this  graph  we  have  a  node  for  each  job  and  a 
directed  —*  yj  edge  for  each  x,  — ►  yj  constraint.  Figure  2.2  shows  the  graph  for  Figure  2.1. 

For  this  constraint  graph  we  can  buUd  an  n  x  m  adjacency  matrix.  Each  column  will 
correspond  to  an  x,  job  and  each  row  to  a  yj  job.  There  will  be  a  1  in  Xj’s  column  and  yj  s 


Constraints: 
Valid  schedule: 
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Figure  2.2:  Graph  representation 


Xi  X2  ®3  ®4 

yi  X  X  • 

j/2  •  •  X  • 

y3  ■  •  X  • 

VA  •  •  X  X 

Figure  2.3:  Matrix  representation 

row  iff  there  is  an  -»  y,  edge— that  is,  an  Xi  y,  constraint.  Otherwise  the  matrix  entry 
will  be  a  0.  When  we  actually  show  these  matrices,  we  will  show  the  Is  as  xs  and  the  Os 
as  *8.  Figures  2.3  and  2.4  show  two  possible  adjacency  matrices  for  the  graph  in  Figure  2.2. 

Note  that  the  arrangement  in  Figure  2.4  is  lower  triangular— all  the  xs  are  on  or  below 
the  main  diagonal.  This  indicates  that  the  length  5  schedule  Sx  =  [®3,®4,®2,®i,0]  and 
Sy  =  [0,  J/2,!/3,y4,yi]  is  valid.  The  arrangement  in  Figure  2.3  protrudes  one  diagonal  above 
the  main  diagonal.  Thus,  the  following  length  6  schedule  is  valid:  Sx  =  [®i,®2,®3i®4>0>0]. 

Sy  =  [0,0,l/i,»2,y3,l/4]. 

X3  X4  Xj  Xi 
y2  X  •  •  • 

y3  X  • 

1/4  X  X  •  • 

j/i  •  •  X  X 


Figure  2.4:  Alternate  matrix  representation 
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Definition  2.1  An  ordering,  like  a  schedule,  specifies  the  order  of  execution  of  jobs;  but 
unlike  a  schedule  it  does  not  indicate  the  exact  time  of  execution.  Thus,  an  ordering  uses 
no  0  jobs.  For  example,  the  ordering  Oy  =  [yi.!/2,y4iJ/3]  indicates  that  we  first  execute  yi 
then  ya  then  y4  then  ys,  but  it  doesn’t  specify  when  we  will  execute  them.  Thus  an  ordering 
is  simply  a  permutation  of  its  jobs. 

Definition  2.2  A  compact  schedule  is  one  in  which  there  are  no  idle  times  between  jobs. 
There  will  only  be  idle  times  before  and  after  execution.  For  example,  the  schedule  5  = 
[0, 0,  ®i,  xa, . . . ,  Xm,  0, 0]  is  compact. 

Definition  2.3  The  delay  for  a  compact  schedule  {Sx,Sy)  is  the  difference  between  the 
number  of  0  jobs  at  the  beginning  of  the  Sy  schedule  and  the  number  of  0  jobs  at  the 
beginning  of  the  Sx  schedule.  For  example,  for  the  schedule  S  =  {Sx,Sy)  with  Sx  = 
[ii,xa,X3,i4,0,0],  and  Sy  =  [0,0,yi,y2.y3.y4]j  then  delay(5)  =  2-0  =  2.  This  definition 
allows  negative  delays.  For  non-compact  schedules,  we  will  define  the  delay  as  the  difference 
between  the  number  of  0  jobs  before  the  last  Y  job  in  the  Sy  schedule  and  the  number  of 
0  jobs  before  the  first  X  job  in  the  Sx  schedule. 

It  turns  out  that: 

Lemma  2.1  If  5  =  (5x,  5r)  is  valid  and  has  delay(5)  =  k  then  there  is  a  compact  schedule 
S'  =  iS'xtSy)  that  has  delay(5')  =  k. 

Proof:  Look  at  all  idle  Y  jobs  that  execute  before  the  last  non-idle  Y  job.  Move  these 
jobs  to  the  front  of  the  Y  schedule,  yielding  a  compact  schedule  for  Y . 

Look  at  all  idle  X  jobs  that  execute  after  the  first  non-idle  X  job.  Move  these  jobs  to 
the  end  of  the  X  schedule,  yielding  a  compact  schedule  for  X . 

This  resulting  schedule  is  compact  and  has  the  same  delay  as  the  original.  I 

So,  given  an  ordering  of  the  X  jobs  and  an  ordering  of  the  Y  jobs,  we  can  easily  determine 
the  delay  right  away.  Form  the  adjacency  matrix  with  rows  and  columns  arrayed  as  in  the 
given  orderings.  If  the  xs  protrude  above  the  main  diagonal  by  k  diagonals,  then  the  delay 

will  be  fc  -f  1  units  of  time. 

There  is  another  interesting  aspect  to  note  here. 

Lemma  2.2  A  square  matrix  M  is  triangularizable  iff  its  transpose  is  triangularizable. 
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Proof:  This  is  easy  to  see  by  example.  Say  that  our  original  matrix  is: 

Xi  X2  2C3 
1/1  X  • 

y2  X  X  • 

ya  X  X  X  • 

y4  X  X  X  X 

Hence,  the  transpose  is: 

yi  y2  1/3  1/4 

®i  X  X  X  X 

®2  •  XXX 

®3  •  •  X  X 

®4  •  •  •  X 

If  we  merely  reverse  the  order  of  the  ®8  and  the  ys  we  get: 

J/4  1/3  1/2  1/1 

®4  X  • 

®3  X  X  • 

®2  X  X  X 

®1  X  X  X  X 

Thus,  we  can  see  that  if  a  matrix  is  triangularizable,  then  so  is  its  transpose.  Since 
transposition  is  its  own  inverse,  the  converse  is  also  true.  | 

While  this  is  relatively  intuitive  when  we  look  at  the  problem  as  a  matrix  problem,  its 
equivalent  statement  as  a  scheduling  problem  is  not  as  obvious. 

Definition  2.4  The  reverse  of  a  schedule  5,  indicated  by  rev(5),  is  obtained  by  listing  the 
elements  of  the  schedule  in  reverse  order.  That  is,  rev([®i,®2,®3>0])  =  [0j®3.®2»®i]- 

With  this  notation  and  a  slight  extension  of  the  proof,  we  can  state  a  stronger  scheduling 
problem  variation  of  Lemma  2.2. 

Corollary  2.3  For  a  constraint  graph  G  with  |X|  =  |y|,  there  is  a  schedule  5  =  (5jr,  5r) 
with  delay(S)  =  k  iff  there  is  a  schedule  5'  =  (rev(5r),rev(Sjr))  with  delay(50  =  k  for  the 
constraint  graph  G’  obtained  from  G  by  reversing  all  the  arcs  and  interchanging  the  roles 
of  the  X  nodes  and  Y  nodes.  | 


2.1.  THE  BASIC  PROBLEM 

yi 

ya 

y4 

Figure  2.5:  Transposed  version  of  Figure  2.2 

That  is,  instead  of  the  graph  in  Figure  2.2  we  would  have  the  graph  in  Figure  2.5. 
Visually  comparing  these  two  figures,  it  is  not  obvious  that  they  have  the  same  delay;  but 
we’ve  seen  that  they  do.  And  this  delay  is  achieved  in  the  reversed  arc  case  by  reversing 
the  schedules  from  the  non-reversed  case. 

Lastly,  there  is  an  even  stronger  corollary. 

Corollary  2.4  If  (5x,5y)  is  optimal  for  the  square  matrix  M,  then  (rev(5r),rev(Sx))  is 
optimal  for  I 

2.1.2  Implied  Orderings 

We  can  make  a  further  observation.  If  we  are  given  an  ordering  of  the  X  jobs  we  can  easily 
find  an  ordering  of  the  Y  jobs  that  minimizes  the  delay  for  the  given  ordering  of  the  X  jobs; 
that  is,  the  ordering  for  the  Y  jobs  is  implied  by  the  ordering  for  the  X  jobs.  There  are  a 
number  of  equivalent  ways  to  look  at  the  method  for  ordering  Y . 

VIEW  1:  Start  with  the  constraint  graph,  an  ordering  of  the  X  nodes  and  with  a  null 
ordering  of  the  Y  nodes. 

1.  If  any  Y  node  is  isolated,  remove  it  and  append  its  job  to  the  end  of  the  Y  ordering. 

2.  Repeat  until  the  graph  is  empty:  Remove  the  next  X  node  according  to  the  ordering; 
if  any  Y  node  is  isolated,  remove  it  and  append  its  node  to  the  end  of  the  Y  ordering. 

So,  for  Figure  2.2  with  Ox  =  the  iterations  through  the  loop  modify  the 

graph  and  schedule  as  shown  in  Figure  2.6. 
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*4  • - O  U 

Ox“f’'4l  °X“I1 

OY-iva-ys-yii  OY-iy2'y3'yi'y4i 


O  Vi 


- -o  y4 

X-l*i.X4l 
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Figure  2.6:  Snapshots  of  VIEW  1 


2.1.  THE  BASIC  PROBLEM 


®2  ®3  ®1  ®4 

1/1  X  •  X  • 

y2  •  X  • 

1/3-X- 

1/4  •  X  •  X 


l2  ®3  ®l  ®4 
1/2-X- 
1/3-X- 

1/JX-X- 

1/4*  •  X  •  X 


X2  ®3  ®1  ®4 

1/1  X  •  X  • 

I/2  •  X  • 

1/3-X- 

2/4*  •  X  -  X 


X2  ®3  ®1  ®4 

2/5  •  X  -  - 

■  X  -  - 

yi*x-x- 

1/4*  •  X  -  X 


Figure  2.7;  Snapshots  with  fixed  Ox- 


VIEW  2:  Consider  the  adjacency  matrix  with  the  columns  in  Ox  order.  At  the  beginning 
all  rows  are  “unfrozen”.  When  we  freeze  a  row  it  can’t  be  moved  anymore.  A  frozen  row  will 
appear  as  y*.  Iterating  through  the  columns  in  reverse  order,  for  each  column:  compact 
downward  all  unfrozen  rows  with  an  x  in  this  column  and  then  freeze  those  rows.  For 
example,  VIEW  2  gives  the  snapshots  shown  in  Figure  2.7.  At  the  end,  we  can  read  off  the 
rows  from  top  to  bottom,  yielding  Or  =  [2/2  >  1/3  >2/1  >1/4]- 

We  can  also  perform  an  analogous  algorithm  for  a  fixed  Oy  and  thus  find  the  optimum 
Ox  for  that  Oy.  Rather  than  specifying  the  analogous  algorithm  for  a  fixed  Oy,  we  will 
devise  an  equivalent  algorithm  by  using  the  algorithm  for  a  fixed  Ox  3^  3  sub-routine.  This 
equivalent  algorithm  merely  consists  of  transposing  the  matrix,  reversing  the  order  of  the  X 
nodes  and  Y  nodes  and  then  calling  the  algorithm  for  a  fixed  Ox-  The  resulting  snapshots 
are  shown  in  Figure  2.8. 

So  iterating  this  procedure  has  given  us  a  delay  =  1  optimum  schedule.  In  general, 
iterating  this  procedure  will  not  always  produce  globally  optimal  schedules.  For  instance, 
the  fixed  point  shown  in  Figure  2.9  is  non-optimum.  The  optimum  Y  order  for  [®i,  ®2j  ®3)  ®4] 
is  [1/1 1 1/2 >1/3)  1/4]  and  the  optimum  X  order  for  [1/1,2/21 1/3)  1/4 ]  is  [®i,®2,®3,®4]-  These  have  a 
delay  of  2;  but  the  schedule  with  Sx  =  [®4,®3,®2,®i]  and  Sy  =  [2/4,l/3,l/2,2/i]  gives  us  a 
lower  triangular  matrix — that  is,  a  schedule  with  a  delay  of  only  1. 
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Figure  2.8:  Snapshots  with  fixed  Oy. 
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Figure  2.9:  Non-optimum  fixed  point 
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®1  12  ®3  ®4 

yi  X  ^ 

y2■x■ 

ys  X  X  •  X 

1/4  •  •  X  • 

ys  X  •  XX 

Figure  2.10:  Mi  satisfies  lt(Mi,2)  but  not  lt(Mi,l) 

Xl  l2  ®4 

J/i  X  •  •  • 

2/2  •  X  • 

2/4  •  •  X  • 

2/3  X  X  ■  X 

2/5  X  •  XX 

Figure  2.11:  M2  satisfies  lt(M2,l) 


2.2  More  Basics 

We’ve  been  relatively  informal  so  far.  But  now  we  will  need  to  get  more  mathematical  and 
precise.  For  this  let  us  concentrate  on  the  matrix  formulation. 

An  n  X  m  matrix  M  is  lower  triangular  below  the  fc-diagonal  if  all  the  Is  are  strictly 
below  the  fc-th  diagonal,  where  the  mmn  diagonal  is  labelled  the  0-th.  Let’s  define  a  function 
which  tells  us  if  a  matrix  is  lower  triangular  below  the  fc-th  diagonal. 

Definition  2.5  For  an  n  X  m  matrix  M,  let  the  boolean  function  lt(M,t)  be  true  iff  each 
element  of  M[r][c]  satisfies  c  —  r  >  t  ^  M[r][c]  =  0. 

The  matrix  Mi  in  Figure  2.10  satisfies  lt(Mi,  2)  but  not  lt(Mi,  1).  When  we  interchange 
rows  2/3  and  2/4  yielding  the  matrix  M2  in  Figure  2.11,  then  M2  satisfies  lt(M2, 1). 

Hence,  if  the  order  of  the  rows  indicates  the  ordering  of  the  Y  jobs  in  a  particular 
schedule  and  the  order  of  the  columns  indicates  the  ordering  of  the  X  jobs  in  the  same 
schedule,  then  the  delay  of  the  schedule  is  equal  to  the  smallest  t  such  that  lt(M,  t)  is  true. 

What  we  are  ultimately  after,  though,  is  the  minimum  delay  over  all  permutations  of 
the  rows  and  columns. 
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Definition  2.6  Let  the  boolean  function  plt(M,t)  be  true  iff  there  is  a  matrix  M',  derived 
from  M  by  permutations  of  rows  and  columns,  such  that  lt(il(f  ,t)  is  true. 

So  for  the  matrix  Mi  in  Figure  2.10  and  the  matrix  M2  in  Figure  2.11,  plt(Mi,  1)  and 
plt(M2,l).  In  general,  if  Mi  is  a  permutation  of  M2,  then  plt(Mi,t)  =  plt(M2,t). 

With  these  new  definitions,  our  transposition  result  in  Corollary  2.3  can  be  stated  as: 

Corollary  2.6  For  a  square  matrix  M,  plt(M,  i)  <=>•  plt(M^,t).  I 

This  can  be  generalized  for  rectangular  matrices. 

Corollary  2.6  For  an  n  x  m  matrix  M,  plt(M,t)  <=>  plt(M’^,t  +  n  -  m).  I 

Corollary  2.4  can  also  be  generalized  for  rectangular  matrices. 

Corollary  2.7  If  (Sx,Sr)  is  optimal  for  the  n  x  m  matrix  M,  then  (rev(5y),rev(5A-))  is 
optimal  for  I 

For  any  given  matrix  M,  we  will  refer  to  the  smallest  i  such  that  ph(M,i)  holds  as  the 
overhang  of  the  matrix.  This  will  occasionally  be  written  as  overhang(M ). 

Alternately,  we  could  define 

overhang(M)  =  mindelay(A  Af5) 

AfB 

where  A  and  B  are  row  and  column  permutation  matrices,  respectively. 

Let’s  explore  the  following  problem. 

Permutable  into  Lower  Triangular  (PLTi) 

INSTANCE:  An  n  X  m  matrix  M  and  a  number  i. 

QUESTION:  Is  plt(Af,t)  true? 

Thus,  PLTi  is  simply  the  triangularization  problem.  PLTo  is  similar,  except  that  Is 
aren’t  allowed  on  the  diagonal. 

The  first  obvious  question  is,  how  are  these  problems  related  to  each  other  for  various 
values  of  t? 

Let’s  start  by  taking  an  arbitrary  n  x  m  matrix  M  and  adding  to  it  a  new  column 
®m+i  with  all  Is.  Call  the  new  matrix  M'.  If  we  do  this  to  Mj,  we  get  the  matrix  M^ 
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*1  l2  ®3  ®4  ®5 

yi  X  ■  •  •  X 

J/2  •  X  •  •  X 

y3  •  •  X  •  X 

y4  X  X  •  XX 

y5  X  •  XXX 

Figure  2.12: 

Is  Ii  ®2  3:3  14 

2/1  X  X  • 

y2  X  •  X  • 

2/3  X  •  -x- 

3/4  X  X  X  •  X 

1/5  X  X  •  XX 

Figure  2.13:  plt(M2,2) 

shown  in  Figure  2.12.  By  changing  the  column  order  as  shown  in  Figure  2.13  we  see  that 
plt(Jlf2,l)  plt(M2,2).  Our  claim  is  that  for  any  choice  of  M,  plt(Af,  1)  <=^ 

plt(M',2). 

This  is  fairly  straightforward  to  see.  If  we  are  given  some  delay  =  1  schedule  (5x,5y) 
for  M,  then  it  is  clear  that  adding  the  new  column  to  the  beginning  oISx  yields  a  delay  =  2 
schedule  for  M'. 

Similarly,  if  we  are  given  some  schedule  for  M',  delay  =  2,  then  it  is 

clear  that  by  merely  dropping  out  the  new  column,  we  get  a  schedule  for  M  with  delay  =  1 , 
because  x^+i  must  be  executed  before  any  Y  job  and  now  the  Y  jobs  can  all  be  executed 
one  time  unit  earlier. 

So  for  any  Af,  we  can  find  a  matrix  M'  such  that  plt(A/,  1)  > — plt(A/  ,2).  This  tells 
us  that  PLTi  :<r  PLT2.  More  generally,  this  same  technique  tells  us  that: 

Lemma  2.8  PLTi  PLTi+i,  for  t  >  0.  I 

Let’s  formalize  some  of  the  facts  that  we  implicitly  used. 

Lemma  2.0  If  there  is  some  job  ijt  with  all  Is  in  its  column,  then  there  is  an  optimal 
schedule  that  has  Xk  first  in  Sx- 
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Xi  X2  ®3 

J/l  X  X  •  •  X 

J/2  •  X  X  X 

ya  X  X  ■  ■  X 

y4  ■  X  X  X  • 

ys  •  X  X  •  • 

Figure  2.14:  Example  of  domination 

Proof:  Assume  without  loss  of  generality  that  there  is  an  optimal  schedule  with  Sx  = 
[ii,  ®2i  •  •  •  > ®m]-  Consider  changing  the  schedule  to  5^  =  [ijt,  xi,  ®2>  ■  •  •  >  ®fc+i>  •  •  •  >  ®m] 
and  leaving  5y  =  5y .  Since  no  Y  job  could  have  started  before  time  + 1,  this  new  schedule 

is  still  valid.  | 

Similarly,  we  can  establish  a  dual  result, 

Lemma  2.10  If  there  is  some  job  yk  with  all  Is  in  its  row,  then  there  is  an  optim^  schedule 
which  has  j/fc  last  in  5y. 

Proof;  Instead  of  using  a  proof  analogous  to  that  of  Lemma  2.9,  let’s  use  what  we  know 
about  transposition.  Look  at  Now,  by  Lemma  2.9,  we  know  that  there  is  an  optimum 
schedule,  (5y,  Sx),  for  M’'  with  y*  first  in  5y.  By  our  transposition  result  of  Corollary  2.7, 
(rev(5x),rev(5y))  is  optimum  for  =  M.  Clearly  this  schedule  uses  y*  last.  | 

This  idea  can  be  extended.  We  will  say  that  a  column  of  a  matrix  dominates  another  if 
the  first  one  has  a  1  in  every  row  that  the  second  one  does. 

For  example,  in  Figure  2.14  column  X2  dominates  columns  xi,  X3,  X4,  and  X5.  Column 
X3  dominates  column  X4.  Columns  xi  and  X5  dominate  each  other.  Clearly  two  columns 
can  dominate  each  other  only  if  they  are  equal. 

This  allows  us  to  generalize  Lemma  2.9. 

Lemma  2.11  If  column  Xj  dominates  column  x*,,  then  there  is  an  optimum  schedule  with 
job  Xj  preceding  job  x*. 

Proof;  Similar  idea  to  that  in  Lemma  2.9.  Assume  we  have  an  optimum  schedule 
{SxjSy)  with  Xjfe  before  Xj.  Without  loss  of  generality,  we  can  assume  that  k  <  j  and  that 
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Sx  =  [®i)®2)  •  •  •  )®m]'  Let  5^  =  [ii, .  •  -y^k-it^k+iy  •  -  ®i+i>  •  •  Oiice  again 

it  is  clear  that  (5^,  5r)  is  a  valid  schedule.  | 

We  can  define  the  same  idea  for  rows.  In  Figure  2.14  rows  y2  and  y4  dominate  each 
other  amd  row  y^.  Transposition  clearly  gives  us: 

Corollary  2.12  If  the  row  for  job  yj  dominates  the  row  for  job  yk,  then  there  is  an  optimum 
schedule  with  job  yj  following  job  yk-  I 

We  say  that  a  column  strictly  dominates  another  if  it  dominates  the  other  and  they  are 
unequal.  Thus,  in  Figure  2.14,  column  sci  does  not  strictly  dominate  *5  nor  vice  versa.  But 
column  X2  strictly  dominates  columns  ii,  13, 14,  and  X5  and  column  X3  strictly  dominates 
columns  X4. 

By  applying  the  same  technique  from  Lemma  2.11  iteratively,  we  get  an  even  stronger 
result: 

Lemma  2.13  There  is  an  optimum  schedule  such  that 

1.  For  every  pair  of  jobs  such  that  Xj  strictly  dominates  Xk,  job  Xj  precedes  Xfc. 

2.  For  every  pair  of  jobs  such  that  yj  strictly  dominates  yfc,  job  yj  follows  yk- 

3.  For  every  pair  of  X  jobs  that  have  equal  columns,  every  intervening  X  job  has  an 
equal  column. 

4.  For  every  pair  of  Y  jobs  that  have  equal  rows,  every  intervening  Y  job  has  an  equal 
row. 

I 


Also  note  that  given  any  optimal  schedule  that  doesn’t  satisfy  these  conditions,  we  can 
modify  it  to  satisfy  them  in  polynomial  time. 

While  all  of  this  is  interesting  and  will  be  useful  later,  let’s  return  to  our  immediate  task 
of  exploring  the  relationship  between  the  PLTi  problems.  We  know  that  PLTi  :<r  PLTi+i 
if  t  >  0.  What  if  t  <  0? 

Lemma  2.14  If  plt(Af,i)  for  i  <  0,  then  M  must  have  at  least  |t|  +  1  rows  of  all  Os.  I 
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So  assume  we  are  trying  to  solve  plt(Af,  t)  with  t  <  0.  Since  there  is  at  least  one  row  of 
all  Os,  we  create  matrix  M'  by  deleting  one  such  row. 

Clearly  if  we  have  a  delay  =  t  schedule  for  M,  then  we  can  obtain  a  delay  =  t  +  1 
schedule  for  M'.  The  reverse  is  just  as  clear:  If  we  have  a  delay  =  t  +  1  schedule  {S'x,S{r) 
for  M'  then  we  can  obtain  a  delay  =  i  schedule  for  M  by  re-adding  the  deleted  row  at  the 
beginning  of  Sy.  Thus  plt(M',  i-h  1)  =  plt(M,  t). 

So,  now  we’ve  shown  PLT^  PLTi.j-1  for  all  i  <  0  and  we  already  showed  in  Lemma  2.8 

that  it  was  true  for  i  >  0.  Hence: 

Lemma  2.15  PLTj  -^-p  PLTj+i  for  all  t.  | 

Similarly,  we  can  show  that 
Lemma  2.16  PLTj+i  PLTj  for  all  t. 

Proof:  Start  with  a  matrix  M'  and  the  question  plt(Jl!f^t  -h  1).  Create  M  by  adding  a 
row  of  Os.  Our  argument  from  the  earlier  paragraph  still  holds,  telling  us  that  plt(Af  ,  i-l-1)  = 
plt(M,  t)  and  hence  PLT^+i  PLTj.  I 

Consequently  aU  the  problems  PLTi  have  equivalent  complexity  up  to  polynomial  fac¬ 
tors,  for  fixed  t  . 

2.3  Variants 

There  is  still  much  more  use  we  can  make  of  our  dominance  lemma.  Lemma  2.13.  We  can 
use  it  to  show  that  our  original  problem  is  equivalent  to  a  more  general  variant.  This  general 
version  allows  x  — ^  x  and  y  — ^  y  precedence  constraints  and  non-umt  execution  times. 

2.3.1  Non-unit  execution  times 

Let’s  start  with  non-unit  execution  times.  In  this  case  we  associate  a  positive  integer  weight 
or  execution  length  with  each  node/job  as  in  Figure  2.15. 

If  we  iteratively  apply  the  following  modification  to  each  X  node  and  Y  node  then  we 
will  have  an  instance  of  our  original  problem:  If  a  node  is  of  weight  k,  then  replace  it  with  k 
1-unit  nodes,  each  of  which  has  arcs  to/from  the  same  nodes  as  the  original  weighted  node 
did.  See  Figure  2.16  for  an  example  of  the  transformation. 
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Figure  2.16:  Mapping  weighted  times  to  unit  execution  times 


18 


CHAPTER  2.  BASICS  AND  WARMUPS 


We  will  refer  to  11,1,  xi,2  and  ®i,3  as  the  sub-nodes oi  xi.  Obviously  a  delay  =  t  schedule 
for  the  weighted  version  yields  a  delay  =  t  schedule  for  the  unit  weight  version.  What  we 
must  show  is  that  any  delay  =  i  schedule  for  the  unit  weight  graph  can  be  modified  to  yield 
a  delay  =  i  schedule  for  the  original  graph.  The  dominance  lemma  does  all  the  work  for  us. 

Assume  we  have  a  delay  =  i  schedule  of  G'.  By  Lemma  2.13  we  can  easily  construct  a 
schedule  where  all  nodes  of  equal  dominance  are  adjacent.  For  example,  given  the  following 
schedule  for  the  modified  version  of  the  graph  in  Figure  2.15 


([*1,1,  *2.1,  ®5,1,  *2.2,  *3,2,  *4,1,  *3,1, 0],  [0, 1/1,2,  J/l,3, 1/4,1,  3/1,1,  J/2,1, 3/3,1, 1/3,2]) 


dominance  tells  us  that  we  can  modify  it  to 


®5,1,  ®2.1,  ®2,2,  a^3,2,  ®3,1, 0],  [0,  2/1,2,  2/1.3,  2/l,l ,  ^4,1 , 2/2.1 , 2/3.1 , 2/3,2]) 

After  this,  the  only  case  when  a  set  of  sub-nodes  aren’t  adjacent  is  if  two  of  the  weighted 
nodes  had  exactly  the  same  neighbors — as  happened  with  ®3  and  X4  in  the  example. 

But,  clearly  in  such  a  case,  we  can  re-order  the  nodes  yielding: 

([a^l.l,  ®5,1,  ®2,1,  ®2.2,  ®4.1,  ®3,2,  ®3,1, 0],  [0,  2/1,2,  2/1.3,  2/l,l>  ^4.1, 2/2.1,  2/3.1, 2/3.2]) 

Since  all  sub- jobs  are  adjacent,  the  weighted  node  schedule 

([®1,  «5,  ®2,  ®2,  ®4,  ®3,  ®3,  0],  [0,  2/1,  2/1,  2/1, 2/4,  2/2, 2/3,  2/3]) 

is  clearly  valid.  So,  we  can  transform  any  delay  =  i  schedule  of  the  unit-weight  version  into 
a  delay  =  i  schedule  of  the  weighted  version, 

2.3.2  Allowing  x  — >  x  Constraints 

Let’s  look  at  what  happens  to  the  problem  when  we  allow  x  x  constraints.  Our  basic 
claim  is  that  we  can  replace  all  the  ®  — ♦  ®  links  by  a  set  of  ®  — >  y  links  such  that  any 
solution  for  the  modified  problem  can  be  converted  into  a  solution  for  the  original  problem 
and  vice-versa.  Figure  2.17  has  an  example. 

The  way  we  perform  this  transformation  is  as  follows: 

1.  Find  the  transitive  closure  of  all  the  constraints  and  add  all  those  edges  to  the  graph. 


2.  Throw  away  all  the  ®  — ►  ®  edges. 
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original 


modified 


Figure  2.17:  Removing  x  —*  x  edges:  valid 

original  modified 


It  is  clear  that  taking  the  transitive  closure  has  no  effect  on  the  solutions:  any  valid 
schedule  before  is  a  valid  schedule  afterward  and  vice-versa. 

What  we  must  show  is  that  it  is  permissible  to  throw  away  the  x  x  edges  at  this 
point.  Our  difficulty  is  shown  in  Figure  2.18.  It  has  no  valid  schedule  before  discarding  the 
edges,  but  it  does  have  one  afterwards. 

Hence,  we  will  show  that  if  the  original  has  a  valid  schedule,  then  any  delay  =  i  schedule 
of  the  original  graph  can  be  converted  into  a  delay  =  i  schedule  of  the  modified  graph. 

Again,  it  is  obvious  that  any  delay  =  t  schedule  of  the  original  graph  is  a  delay  =  i 
schedule  of  the  modified  graph. 

We  need  to  show  the  converse.  Say  we  are  given  a  schedule  of  the  modified  graph.  To 
get  a  valid  delay  =  i  schedule  of  the  original  graph  simply  perform  the  following  routine: 

1.  Apply  the  dominator  method  to  it,  yielding  a  schedule  (5^,  Sy)  satisfying  Lemma  2.13. 

2.  If  any  columns  of  the  modified  graph  are  equal,  their  x  nodes  will  all  be  adjacent 
to  each  other.  Re-order  these  columns  in  topological  order  according  to  the  x  x 
ordering  of  these  nodes  in  the  original  graph. 

To  see  correctness:  If  Xj  precedes  Xj  in  the  original  graph  then  column  Xj  dominates 
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column  Xj  in  the  modified  graph.  There  are  two  cases.  In  one  case,  column  strictly 
dominates  column  Xj,  and  hence  will  already  be  before  Xj  in  5^.  In  the  other  case,  Xi 
and  Xj  have  equal  columns  and  there  again,  Xi  will  be  before  Xj  because  of  the  topological 
ordering  in  step  (2)  of  our  routine. 

2.3.3  All  Together 

What  about  allowing  y y  arcs?  Although  transposition  as  we’ve  proven  it  so  far  doesn’t 
automatically  imply  this  from  the  results  about  the  x  x  arcs,  it  can  be  easily  extended 
to  do  so.  That  is 

Lemma  2.17  Given  an  optimum  schedule  {Sx,Sy)  for  a  graph  G  with  x  -*  y  and  x  -►  x 
arcs,  (rev(5r),rev(5A-))  wiU  be  an  optimum  schedule  for  the  graph  with  the  directions  of 

the  edges  reversed.  | 

But  ultimately,  we  claim  not  merely  the  three  individual  results  but  rather  the  compo¬ 
sition  of  them.  We  need  to  transform  a  graph  with  x  x  edges,  y -*  y  edges  and  non-unit 
weights  into  an  equivalent  one  without  x  ->  x  edges,  y  -*  y  edges  and  non- unit  weights. 
Our  combined  transformation  is 

Given  a  graph  G  with  i  x,  i  -+  y  and  y  —  y  arcs  and  non-unit  weights. 

1.  Find  the  transitive  closure  of  G. 

2.  Discard  all  x  — ♦  x  and  y  y  edges. 

3.  Split  up  each  node  with  weight  k  into  k  unit-weight  nodes. 

Assuming  we  have  a  valid  schedule  for  the  modified  problem,  we  can  convert  it  into  a 
valid  schedule  for  the  original,  if  one  exists,  by: 

1.  Apply  the  dominance  lemma  to  both  rows  and  columns. 

2.  For  each  set  of  equal  columns  (rows): 

(a)  Re-order  them  so  that  all  sub-nodes  of  an  original  node  are  adjacent. 

(b)  Combine  those  nodes  into  a  single  node  of  weight  k. 

(c)  Re-order  the  weighted  nodes  to  obey  the  original  topological  ordering. 
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In  going  from  the  schedule  for  the  modified  graph  to  the  schedule  for  the  original  graph, 
all  the  modifications  are  delay-non-increasing. 

This,  along  with  the  fact  that  any  schedule  for  the  original  graph  can  be  transformed 
into  a  schedule  for  the  modified  graph  with  the  same  delay  tells  us  that  the  two  problems 
are  equivalent. 

2.3.4  Contractor  Cash-Flow 

A  non-computer  scenario  where  this  problem  is  useful  involves  a  contractor  who  is  working 
on  a  large  construction  project.  He  is  paid  portions  of  his  earnings  for  reaching  certain 
milestones  in  the  construction:  $A  when  the  foundation  is  laid,  $B  when  the  frame  is  up, 
$C  when  the  plumbing  and  electrical  wiring  are  installed.  Naturally  there  are  restrictions 
on  the  order  he  completes  various  tasks  —  you  can’t  install  wiring  in  thin  air.  Lastly,  the 
contractor  has  to  make  payments  for  materials  and  labor  associated  with  various  tasks. 

The  contractor  doesn’t  have  a  large  financial  reserve,  and  he  needs  to  complete  the 
project  without  running  out  of  cash.  How  should  he  order  the  tasks  so  as  to  minimize  the 
amount  of  his  own  money  that  he  needs  to  use  to  pay  for  the  intermediate  expenses?  He 

doesn’t  want  his  cash  balance  to  go  below  0. 

This  problem  corresponds  to  allowing  x  — ►  i  constraints  and  weighted  nodes.  Every  x 
node  represents  a  task  to  be  completed  and  its  weight  represents  the  cost  in  material  and 
labor  that  the  contractor  must  put  up  to  complete  that  task.  The  x  — »  x  arcs  encode  the 
precedence  relations  among  the  tasks.  There  is  one  isolated  y  node  with  a  weight  equal  to 
his  initial  cash  balance.  The  other  y  nodes  represent  pay-off  conditions,  which  may  be  any 
conjunction  of  tasks  —  if  completing  tasks  xi  and  X3  means  that  the  contractor  is  paid  $u;, 
then  we  create  a  y  node  with  weight  w  and  precedence  arcs  to  it  from  xj  and  X3. 

The  contractor  runs  out  of  cash  iff  overhang  >  0. 

At  this  point  we  should  make  the  technical  note  that  the  sizes  of  the  payoffs  must 
be  limited  by  some  constant  or  limited  by  some  multiple  of  log  n,  for  the  problem  to  be 
equivalent  under  polynomial  reductions  to  PLTo. 


2.4  Allowing  y  x  Constraints 

The  next  natural  question  is  what  happens  when  we  allow  all  four  types  of  arcs:  x  -♦  y, 
X  -*  X,  y  -*  y,  and  y  -»  x.  This  turns  out  to  be  a  well-studied  problem. 
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2.4.1  Microcode  Compaction  for  Very  Long  Instruction  Word  Computers 

One  approach  to  increasing  parallelism  within  a  processor  is  the  use  of  very-long  instruction 
words,  or  VLIWs  [FisTQ]  [Veg82]  [Lam87]  [FERN84]  [Tou84]  [LDSM80].  A  conventional 
processor  will  read  in  a  machine- code  instruction  such  as  *  ADD  R1 ,  R2  which  it  then 
decodes  and  maps  to  a  series  of  microcode  instructions.  These  microcode  instructions  are 
extremely  low-level.  Each  instruction  is  made  up  of  micro-operations:  put  the  contents  of 
R1  out  onto  internal  bus  B;  read  the  contents  of  internal  bus  A  into  the  ALUj  and  so  on. 

By  their  nature,  the  micro-operations  for  different  components  of  the  processor  can 
be  executed  in  parallel.  Due  to  the  coarse  nature  of  the  instructions  like  ‘  ADD  R1 »  R2 
however,  large  portions  of  the  processor  may  be  idle  at  any  time.  The  VLIW  approach 
says:  instead  of  the  program  being  written  as  machine  code,  let’s  have  it  written  as  micro¬ 
code.  Let’s  have  the  compiler  translate  our  high-level  language  into  a  series  of  micro-code 
instructions.  Since  micro-code  specifies  so  much  detail,  it  requires  much  longer  instruction 
words — and  hence  the  name  VLIW . 

Obviously  to  make  use  of  this  new  ability  we  don’t  just  translate  the  source  code  into  the 
same  microcode  instructions  that  the  machine  code  gets  translated  to  and  then  concatenate 
them  all.  This  is  a  first  step,  but  it  doesn’t  yield  any  improvements.  Instead,  we  want  to 
compact  these  micro-ops  together  so  that  some  of  them  will  be  executed  in  parallel.  Natu¬ 
rally,  the  micro-ops  will  have  precedence  constraints  between  them.  The  micro-instructions 
typically  have  several  fields  and  each  micro-op  must  go  into  a  specific  field.  For  example, 
if  there  are  exactly  two  fields,  and  we  represent  instructions  that  must  go  in  the  first  field 
with  circles  and  instructions  that  must  go  in  the  second  field  with  squares,  then  we  might 
have  a  precedence  graph  like  the  one  in  Figure  2.19. 

So  we  can  now  define  the  micro-code  compaction  problem  for  VLIWs. 

Micro-Code  Compaction  for  VLIWs  (MC-COMPACTION) 

INSTANCE:  Set  O  of  unit-time  micro-operations,  number  k  of  classes  of  operations,  class 
number.  1  <  c(o)  <  k  for  each  o€  0,  partial  order  on  0  and  deadline  t. 

QUESTION:  Is  there  a  schedule  of  the  micro-operations  that  obeys  the  partial  order,  never 
schedules  two  micro-operations  with  the  same  value  of  c{o)  for  the  same  time  slot  and  is  of 
length  <  t? 

This  problem  with  A:  =  2  is  equivalent  to  our  scheduling  problem  allowing  x  x,x  y, 
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Figure  2.19:  Micro-operations  with  dependencies 


y  ^  y  and  y  x  arcs. 

This  problem  is  i\rP.complete.  This  was  shown  by  Goyal  [Goy76,  Goy77].  Since  we  will 
want  to  modify  his  proof,  let’s  look  at  it  in  detail. 

He  reduces  the  m-PROCESSOR  SCHEDULING  problem  to  it.  The  m-PROCESSOR 
SCHEDULING  problem  is  known  to  be  iVP-complete  [U1176]. 

m-PROCESSOR  SCHEDULING 

INSTANCE:  Set  X  of  unit-time  tasks,  number  m  of  processors,  partial  order  on  X,  and 
deadline  t. 

QUESTION:  Is  there  a  schedule  of  the  n  =  1X|  tasks  that  obeys  the  partial  order,  never 
schedules  more  than  m  tasks  for  the  same  time  slot  and  is  of  length  <  f  ? 

This  problem  can  be  reduced  to  MC-COMPACTION  with  A;  =  2. 

We  start  with  the  precedence  graph  for  an  instance  of  m-PROCESSOR  SCHEDULING 
such  as  that  in  Figure  2.20. 

Split  each  node  into  two  nodes— an  in-node  and  an  out-node.  Replace  any  arc  Xi  i, 
with  an  xf  arc.  Add  the  arcs  Figure  2.21  shows  the  result. 

The  out-node  jobs  will  be  required  to  execute  on  processor  px  and  the  in-node  jobs  will 
be  required  to  execute  on  processor  py  •  Lastly,  we  will  add  an  enforcing  graph  consisting  of 
the  filled  nodes  in  Figure  2.22.  The  dashed  nodes  represent  the  slots  av^able  for  execution 
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Figure  2.22:  Enforcing  graph  for  m  =  3 


of  the  other  jobs. 

We  ask  the  question,  can  the  transformed  jobs  be  scheduled  to  complete  in  2n  time 
steps?  The  answer  is  yes  iff  the  m-PROCESSOR  PROBLEM  could  have  been  scheduled  in 
n/m  time  steps. 

This  is  straightforward  to  see.  The  only  out-nodes  that  can  execute  at  times  m  -f  1  to 
2m  are  those  corresponding  to  the  in-nodes  that  were  executed  from  time  1  to  m.  This  will 
then  make  available  for  execution  at  time  2m  -f- 1  all  those  jobs  that  are  released  by  running 
the  in- jobs  we  ran  at  time  1  to  m.  Thus,  any  schedule  for  the  m-PROCESSOR  PROBLEM 
can  be  transformed  into  a  schedule  for  MC-COMPACTION. 

To  see  the  other  direction,  we  merely  have  to  note  that  there  is  no  way  to  cheat:  (1) 
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No  out-node  can  be  executed  before  its  corresponding  in-node;  (2)  if  an  in-node  is  executed 
during  some  time  range  2a77i  +  1  to  (2<i  +  1)^>  then  all  out-nodes  that  (edgewise)  precede 
it  must  have  been  executed  before  2aTn  +  1  and  hence  all  in-nodes  of  jobs  that  (edgewise) 
precede  it  must  have  been  executed  before  2(a  —  l)Tn  +  1. 

There  is  an  even  tighter  result  that  shows  that  the  MC-COMPACTION  remains  NP- 
complete  even  when  the  precedence  graphs  are  limited  to  being  chains  [LLMS87].  That 
reduction  is  from  3-partition. 

But  we  want  to  limit  the  graphs  in  a  different  direction.  We  want  to  know  the  minimum 
degree  of  y  x  arcs  we  can  allow  while  still  having  the  problem  be  TVP-complete. 

Our  construction  shows  that  allowing  one  y  x  and  one  y  y  out  of  each  y  node  and 
one  X  ^  a;  out  of  each  X  node  and  an  unlimited  number  of  x  ^  y  arcs  makes  the  problem 
i\rP-complete.  What  if  we  didn’t  want  to  allow  y  y  and  x  x  arcs?  We  could  try  the 
dominance  methods  we  tried  in  Section  2.3.  But  they  don’t  carry  over. 

Alternately  we  could  replace  the  enforcing  graph  of  Figure  2.22  with  that  of  Figure  2.23. 
That  is,  for  the  enforcing  graph,  each  X  node  has  an  edge  to  each  Y  node  in  the  following 
string  ofY  nodes  and  vice  versa.  Clearly  the  enforcing  structure  is  equivalent  to  the  original 
and  uses  no  x  x  and  y  — >  y  arcs. 

Can  we  reduce  the  number  of  y  — >  x  arcs?  Yes,  the  enforcing  structure  in  Figure  2.24 
uses  at  most  two  y  — ►  x  arcs  per  Y  node. 

It  seems  hard  to  get  WP-completeness  with  at  most  one  y  x  arc  per  Y  node.  We  will 
say  more  about  this  in  Section  2.5. 

However,  it  turns  out  that  we  can  get  J\rP- completeness  with  at  most  one  y  — ►  x  arc  per 
Y  node  and  one  x  x  arc  per  X  node.  This  is  done  by  using  the  enforcing  structure  in 
Figure  2.25. 

It  is  not  clear  what  happens  when  we  allow  one  y  x  arc  and  one  y  y  arc  per  Y 
node — or  even  k  y  y  arcs  per  Y  node. 

2.5  Pipeline  Scheduling 

A  special  case  of  allowing  one  y  x  arc  per  y  node  is  related  to  the  problem  of  pipeline 
scheduling  [Gro83]  [HG83]  [GM86]. 

Like  very  long  instruction  words,  pipelining  is  a  technique  to  keep  less  of  a  computer 
processor  idle  at  any  time,  thus  increasing  the  effective  execution  rate  of  programs. 


28 


CHAPTER  2.  BASICS  AND  WARMUPS 


Figure  2.24:  Enforcing  graph  for  m  =  4  with  at  most  two  y  x  edges  per  Y  node 
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Figure  2.25:  Enforcing  graph  for  m  =  3  with  at  most  one  y  -* 
most  one  x  —*  x  edge  per  X  node 
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In  a  pipelined  computer,  the  processor  is  split  up  into  a  series  of  stages.  During  an 
instruction’s  execution,  it  passes  from  one  stage  to  the  next,  freeing  up  the  previous  stage 
for  the  next  instruction.  The  hope  is  that  each  stage  of  the  processor  can  always  be  kept 
busy  processing  some  instruction. 

With  pipelining,  a.  new  complication  sets  in.  Sometimes  an  instruction  must  be  pre- 
vented  from  starting  until  after  some  other  instruction  is  completely  precessed  through  all 
stages  of  the  pipeline.  This  leaves  intervening  execution  slots  that  should  be  fUled. 

Hence,  one  variant  that  arises  for  a  fc-stage  pipeline  is 

PIPELINE-fc 

INSTANCE;  Set  X  of  unit-time  instructions,  number  k  of  pipeline  stages,  partial  order 
on  X,  and  deadline  t. 

QUESTION:  Is  there  a  schedule  of  the  n  =  |Ar|  tasks  of  length  <  t  such  that  for  every  pair 
of  tasks  such  that  Xi  precedes  xj  in  the  partial  order,  the  instruction  Xi  starts  at  least  k 
time  units  before  Xjl 

If  A:  =  1,  then  we  merely  have  the  normal  DAG  constraints  and  any  topological  order  is 
optimal. 

The  A:  =  2  case  turns  out  to  be  equivalent  to  the  following  problem: 

YX-MATCH 

INSTANCE:  Two  processors  px  and  py,  set  of  unit-time  jobs  X,  set  of  unit-time  jobs  Y 
with  |Y|  =  |X|  =  n,  unlimited  precedence  constraints  of  the  form  Xj  —*  yj,  exactly  one 
y  — >  X  precedence  constriant  coming  out  of  each  Y  node,  exactly  one  y  >  x  precedence 
constriant  going  into  each  X  node,  and  a  time  bound  t. 

QUESTION:  Is  there  a  schedule  of  time  <  t  -|-  1  satisfying  the  precedence  constraints  and 
such  that  the  X  jobs  are  only  scheduled  on  processor  px  and  the  Y  jobs  are  only  scheduled 
on  processor  py? 

The  transformations  are  quite  simple.  To  go  from  the  constraint  graph  of  an  instance 
of  YX-MATCH  (shown  on  the  left  in  Figure  2.26)  to  the  constraint  graph  for  an  instance 
of  PIPELINE-2  (shown  on  the  right  in  Figure  2.26),  simply  merge  together  nodes  Xi  and 
yj  wherever  there  is  an  edge  yj  — >  x^. 
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Figure  2.26:  Transforming  YX- MATCH  to  PIPELINE-2 


The  transformation  from  an  instance  of  PIPELINE-2  to  YX-MATCH  is  simply  the 
reverse  of  this:  split  each  node  into  an  1“  and  an  Any  edge  Xi  -*  xj  gets  mapped  to 
an  xf“‘  -»  edge.  Also  add  the  edges  xj"  xf“‘.  The  in-nodes  must  execute  on  py  and 

the  out-nodes  on  px  • 

Merely  stating  the  transformations  does  not  prove  that  a  solution  to  one  problem  trans¬ 
lates  into  a  solution  for  the  other. 

Lemma  2.18  YX-MATCH  and  PIPELINE-2  are  equivalent  under  polynomial-time  reduc- 
tions. 

Proof:  Assume  that  we  are  given  an  instance  of  YX-MATCH,  that  we  transform  it  into 
an  instance  of  PIPELINE-2  and  that  there  is  a  schedule  of  time  <  t  for  the  PIPELINE-2 
instance.  Without  loss  of  generality,  let  our  schedule  for  PIPELINE-2  be  5  =  [®i)-  •  - 
(x»  may  refer  to  an  idle  job).  Then  our  schedule  YX-MATCH  is  Sy  — 

and  5^  =  where  x^^'  and  x^  are  the  X  and  Y 

nodes  that  were  originally  collapsed  together  to  create  Xj.  Since  for  S\  xj*'  precedes  x^  , 
then  all  y  — ►  X  edges  are  satisfied.  For  any  edge  x*  Xj ,  there  must  be  a  job  in  5  between 
Xi  and  Xj]  hence  there  is  at  least  one  job  in  between  xj*'  and  xj^ .  Thus,  x®**^  is  executed 
before  x^  and  hence  all  x  y  edges  are  satisfied.  Thus  we  have  a  schedule  of  time  <  t  +  1 
for  our  original  instance  of  YX-MATCH. 

Assume  that  we  are  given  an  instance  of  PIPELINE-2,  that  we  transform  it  into  an 
instance  of  YX-MATCH  and  that  there  is  a  schedule  {Sx,Sy)  of  time  <  t  -fl  for  the 
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YX-MATCH  instance.  Let’s  first  create  an  alternate  schedule  of  the  YX-MATCH 

instance.  Let  =  5y.  To  create  5^,  for  each  Xj  schedule  it  one  time  unit  after  the  yi 
that  has  a  yi  Xj  arc.  Claim:  S'  =  {S'x,S{r)  is  valid.  Because  of  our  matching  of  Y  X 
edges,  no  two  X  jobs  are  assigned  the  same  execution  time.  Each  xj  is  executed  no  later  for 
S'x  than  it  did  for  Sx,  since  in  S'x  it  is  executed  as  soon  as  possible,  given  Sy.  Since  X  jobs 
are  moved  earlier,  but  Y  jobs  are  unmoved,  all  X  Y  constraints  remain  satisfied.  Our 
specification  of  S'x  also  satisfies  all  Y  — ♦  X  constraints.  So  S'  is  valid.  To  get  a  schedule 
for  our  original  instance  of  PIPELINE-2,  simply  make  S"  =  Sy^  The  last  job  of  Sy  must 
be  an  idle  one,  so  we  can  discard  it,  yielding  an  S"  of  length  <  t.  We  just  need  to  verify 
that  if  there  is  a  li  -»  Xj  edge,  these  jobs  wiU  edways  be  separated  by  at  least  one  time  unit. 
Since  the  YX-MATCH  version  had  a  xf  path,  then  x?"‘  and  xj“‘  must  have 

had  an  intervening  job.  | 

PIPELINE-2  can  be  solved  by  a  polynomial  time  algorithm.  The  algorithm  is  based  on 
modifications  to  a  classic  scheduling  problem: 

TWO  PROCESSOR  SCHEDULING 

INSTANCE:  Set  X  of  unit-time  tasks,  partial  order  on  X,  and  deadline  t. 

QUESTION:  Is  there  a  schedule  of  the  n  =  |X|  tasks  that  obeys  the  partial  order,  never 
schedules  more  than  two  tasks  for  the  same  time  slot  and  is  of  length  <  t? 

In  other  words,  solve  the  m-PROCESSOR  SCHEDULING  problem  except  that  m  is 
fixed  at  2.  There  are  a  number  of  solutions  to  this  problem  [CG72,  Gab82].  Leung,  Vorn- 
berger  and  Witthoff  [LVW84]  extend  these  to  handle  the  PIPELINE-2  problem  —  which 
they  refer  to  as  the  “directed  separation  problem”. 

Unfortunately,  it  is  not  clear  how  to  modify  this  algorithm  once  we  start  deleting  the 
y  —*  x  arcs.  This  destroys  an  important  notion  used  in  the  papers — the  notion  of  levels — and 
hence  the  notion  of  a  highest-level  is  destroyed. 

Furthermore,  this  is  about  the  only  case  of  pipeline  scheduling  that  is  polynomial.  Gross 
and  Hennessy  [Gro83,  HG83]  showed  that  the  problem  becomes  iVP-complete  if  we  allow 
the  interlocks  to  be  mixed  between  lengths  of  1  and  2  —  where  an  interlock  is  the  amount 
of  intermediate  time  units  that  must  separate  the  execution  of  the  two  instructions. 
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2.6  Precedence  Constraints 

In  summary,  the  following  are  all  iVP-complete: 

•  Xj  yj,  Xi  -*  Xj,  Vi  Vj,  Vi  Xj. 

•  ®t  Vj,  Vi 

•  li  — >  yj,  at  most  2  yi  —*  Xj  per  yi. 

•  Xi  —*  yj,  sX  most  1  yi  —*  Xj  per  yi,  at  most  1  — ♦  Xj  per  Xj. 

The  following  are  all  equivalent: 

•  Xi-*  yj. 

•  Xi-*  yj,  Xi  -*  Xj,  yi  -*  yj. 

The  following  is  polynomial: 

•  Xj  — ►  yj,  and  yj  —*  Xj  for  each  t. 

The  following  is  as  yet  unclassified: 

•  -♦  Vj*  at  most  1  yj  -►  Xj  per  yj. 


Chapter  3 


Well-Ordered  Schedules 


The  relationship  of  triangularization  to  other  problems  is  interesting  in  itself.  However, 
let’s  turn  more  directly  to  the  task  of  dealing  with  the  triangularization  problem  proper. 

3.1  Definitions 

We  will  need  some  new  notation  before  we  plunge  into  this.  The  graph  in  Figure  3.1  will 
serve  as  an  example. 

Definition  3.1  The  r()  function  returns  the  neighbors  of  a  node  or  set  of  nodes.  So  for 
any  node  z,  let  r(z)  be  the  set  of  nodes  adjacent  to  z.  Let  r(^)  =  \Jz€Z  ^  Figure  3.1, 
we  have  r(x2 )  =  {»!,  yz,  Vz}  and  T{{yi ,  1^2})  =  {®i ,  *2}-  Note  that  we  have  stopped  drawing 
the  edge  directions  in  the  drawings;  in  any  figure,  the  black  nodes  must  be  executed  before 
the  adjacent  white  nodes  are  released. 

Definition  3.2  For  a  set  of  nodes  Z,  we  will  sometimes  want  to  know:  if  we  remove  the 
nodes  in  Z  from  the  graph,  what  nodes  become  isolated — that  is  degree  =  0.  This  is  our 
function  T'{Z)  =  {z  \  ?(«)  C  Z}.  Alternatively  r'(^)  =  T{Z). 

So  in  Figure  3.1,  we  have  r'(®2)  =  {j/s}  and  =  {*17*5}* 

Definition  3.3  For  a  schedule  S  of  the  X  nodes,  we  will  use  5*  (0  <  A:  <  m)  to  denote 
the  subschedule  consisting  of  the  first  k  elements.  So,  if  5"  =  [zi,X27 .  •  •7Zm]7  then  Sk  = 

[ll,l2, .  .  .,Xfc]. 
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Figure  3.1;  Example  of  constraints 


We  will  use  T*.  to  indicate  the  set  of  Y  nodes  that  would  be  free  to  execute  if  all  the 
jobs  in  Sk  were  executed.  That  is,  Tk  =  T'{Sk)  for  0  <  k  <  m.  So,  if  in  Figure  3.1 
5  =  then  T3  =  {yi,y2,y3,y4}- 

ATfc  will  be  the  set  of  Y  jobs  released  for  execution  in  going  from  Sk-i  to  5*..  That 
is,  let  ATfc  =  Tfc  \  Tk-i  for  1  <  ib  <  m;  ATo  =  To.  So,  continuing  our  example,  if 
S  =  [a:i,®2,®3,X4,®B]i  then  AT2  =  {yi,y2,y3}- 

As  mentioned  in  Section  2.1.1  a  schedule  for  px  given  by  Sx  implies  a  natural  schedule 
Sy  for  PY .  Two  views  of  this  schedule  were  given.  A  third  way  to  think  of  this  schedule  is 
even  simpler:  [ATo,  ATi,  AT2, . .  - ,  AT™]. 

3.1.1  Charts 

When  a  Y  job  isn’t  available  yet,  then  py  must  just  wait  for  one.  How  long  must  it  wait? 

Definition  3.4  Let  n*  =  IT*]  -  \Sk\,  0  <  fc  <  m  and  n/t  =  \Tk\  -  |5ife+i|,  0  <  fc  <  m.  Let 
dk  =  inin*_o  n»  and  dk  =  nunf_o  n*,. 

The  total  time  that  py  must  wait  is  -dm-i-  So  we  will  call  -dm-i  the  delay  of  5, 
sometimes  written  delay(5).  If  dm-i  is  positive,  then  py  could  have  started  executing 
dm-i  time  units  before  px  did  and  will  not  have  to  execute  an  idle  job  until  all  Y  jobs  are 
exhausted. 
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no  no  ni  ni  r\2  na  na  ns  n4  n4  ns 


Figure  3.2:  Chart  corresponding  to  Figure  3.1 

For  unit  time  jobs,  n*  =  n*.  —  1  and  thus  dm-i  =  dm-i  ~  !•  When  we  generalize  these 
definitions  for  non- unit  time  jobs,  this  will  not  necessarily  be  true. 

If  there  are  no  isolated  X  nodes  in  the  constraint  graph,  then  ^  ^im-i  thus 
dm  =  dm-i-  Hence  if  we  are  dealing  with  only  unit  time  jobs  and  there  are  no  isolated  X 
nodes  in  the  constraint  graph,  then  dm  =  dm-i  +  1-  Thus,  sometimes  when  we  really  want 
to  show  that  an  operation  doesn’t  change  the  delay  of  a  graph,  instead  of  showing  that  dm-i 
doesn’t  change,  we  may  show  that  dm  doesn’t  change.  Since  we  can  always  throw  away 
all  isolated  X  nodes  and  put  them  at  the  end  of  any  schedule  that  we  ultimately  find,  the 
simplification  of  using  dm  doesn’t  cause  any  problems  unless  we  are  dealing  with  lemmas 
about  square  matrices. 

For  any  schedule  S  of  the  X  nodes,  we  will  draw  a  chart  of  the  sequence  differences 
[no,no,ni, . .  .,nm-i,nm].  Figure  3.2  illustrates  the  chart  for  our  example  from  Figure  3.1 
with  the  schedule  5  =  [xi, . . . ,  *5].  We  can  read  the  delay  of  the  schedule  right  off  the  chart. 
The  delay  is  the  negation  of  the  lowest  value  that  the  chart  dips  to.  In  this  case  the  chart 
dips  to  —2,  so  the  delay  is  equal  to  2. 

Before  we  start  it  is  important  to  note:  Since  the  delay(5)  is  determined  by  the  worst  dip 
in  the  chart  (— dm-i)>  and  since  local  modifications  to  5  result  in  only  local  modifications 
to  the  chart,  then  all  we  need  to  do  is  to  make  sure  that  our  modification  doesn’t  worsen 
the  dip  of  the  chart  in  the  neighborhood  of  the  modification  and  that  will  be  enough  to 
ensure  that  we  haven’t  worsened  the  delay.  To  be  more  specific,  partition  a  schedule  into 
three  parts  5  =  [Xi,X2,X^.  If  all  we  do  is  modify  the  order  of  nodes  within  X2,  yielding 
S'  =  [Xi,X2,X3],  then  the  chart  for  Xi  clearly  remains  the  same,  since  Xi  occurs  before 
X2.  Since  no  modifications  to  the  order  of  the  jobs  within  X2  can  allow  X2  to  relase  a  job 
previously  released  by  X3,  and  since  at  the  end  of  Xj  we  will  have  released  exactly  the  same 
jobs  as  released  at  the  end  of  X2,  then  the  chart  for  X3  remains  unchanged. 
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3.2  Two  Is  Per  Row 

Let’s  start  with  a  special  case.  Let’s  consider  the  case  where^our  matrix  has  no  more  than 
two  Is  per  row.  That  is,  each  Y  node  has  degree  at  most  2,  as  in  Figure  3.1. 

3.2.1  Component  Types 

The  first  step  in  understanding  the  case  with  two  Is  per  row  is  to  classify  the  possible 
connected  subgraphs  of  this  class  of  graphs  into  7  types.  Each  type  will  have  a  characteristic 
chart  as  shown  in  Figure  3.3. 

Typ®  (■"!)  ■”!)•  isolated  X  node.  Looking  at  the  chart  for  it  gives  an  explanation  of 
its  name.  Its  worst  (lowest)  dip  is  —1  and  its  final  value  is  —1. 

Type  (0, 1):  an  isolated  Y  node.  Since  there  are  no  X  nodes,  the  chart  instantly  reaches 
its  final  value  of  1.  It  never  dips,  which  we  will  choose  to  define  as  0;  we  will  see  that  this 
choice  works  best,  but  there  is  no  choice  that  really  suffices  in  all  that  follows.  Fortunately, 
these  nodes  and  the  (  —  1,  —1)8  can  be  and  will  be  discarded  early. 

Type  (-1,0):  these  are  the  connected  components  with  the  same  number  of  X  nodes 
and  Y  nodes  and  with  at  least  one  Y  node  having  degree  =  1.  It  follows  that  there  is 
exactly  one  such  Y  node.  For  this  type  of  component,  there  is  always  a  way  to  remove  an 
X  node  such  that  it  releases  a  Y  node  and  leaves  behind  only  (-1,0)  subpieces.  Hence 
the  optimum  chart  will  be  a  saw-tooth,  bouncing  back  and  forth  between  0  and  —1,  ending 
at  0. 

Type  (-1,-Hifc),  where  A:  >  0:  these  are  the  components  with  A:  >  0  more  Y  nodes  than 
X  nodes  and  with  at  least  one  Y  node  having  degree  =  1.  For  this  type  of  component, 
there  is  always  a  way  to  remove  an  X  node  such  that  it  releases  a  Y  node  and  leaves  behind 
only  subpieces  of  types  (0, 1)  and  (—1,/),  where  0  <  I  <  k. 

Type  (-2, 0):  these  are  the  components  with  the  same  number  of  X  nodes  and  Y  nodes, 
and  such  that  every  Y  node  has  degree  =  2.  Clearly  two  X  nodes  must  be  removed  before 
a  Y  node  can  be  released.  There  is  always  a  choice  of  two  X  nodes  whose  removal  releases 
one  Y  node.  If  the  component  has  more  than  two  X  nodes,  this  leaves  behind  a  (  1,-|-1) 
piece  and  possibly  some  (-1,0)  pieces. 

Type  (-2,  +k),  where  A:  >  0:  these  are  the  components  with  A;  >  0  more  Y  nodes  than 
X  nodes  and  with  all  Y  nodes  having  degree  =  2.  Clearly  two  X  nodes  must  be  removed 
before  a  Y  node  can  be  released.  There  is  always  a  choice  of  two  X  nodes  whose  removal 
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releases  one  Y  node.  Other  than  the  just  released  Y  nodes,  the  remaining  graph  will  consist 
of  pieces  of  types  (—1, 0)  and  (—1)  0>  where  0  <  /  <  A:  +  1. 

Xype  (-2,-1):  these  are  the  components  which  have  one  less  Y  node  than  X  nodes, 
yet  all  Y  nodes  have  degree  =  2.  That  is,  they  are  trees  whose  leaves  are  all  X  nodes. 
Removing  any  leaf  turns  it  into  a  (—1,0)  piece. 

3.2.2  Ordering  the  Components 

The  intuition  for  iinding  an  optimum  schedule  is  that  we  want  to  quickly  force  the  chart 
values  to  get  high  enough  so  that  any  subsequent  dips  in  the  chart  won’t  dip  down  very  far. 

It  turns  out  that  it  is  always  best  to  schedule  the  components  in  a  particular  order  based 
on  their  types:  (0,1),  (-1,+A:),  {-2,+k),  (-1,0),  (-2,0),  (-2,-1),  (-1,-1). 

We  already  know  why  it  is  best  to  put  (0,  l)s  first  and  the  (  — 1,  — l)s  last — dominance. 
But  how  can  we  see  the  rest  of  this? 

Definition  8.6  The  borrowing  of  a  subschedule  Sk  is  like  the  delay  of  a  schedule,  except 
that  we  don’t  negate  it,  that  is,  it  equals  dk-\.  The  profit  of  a  subschedule  Sk  is  simply 
|Tfc|  -  |5fc|  =  Tife.  Thus  the  type  (6,p)  of  a  component  reflects  its  borrowing  b  and  profit  p. 

Choosing  to  put  the  (-1,+A:)s  next  after  the  (0,  l)s  is  an  example  of  what  we  call 
“greedy  use  of  minimum-borrowing  non- negative  profit  subschedules” .  This  terminology 
may  cause  some  confusion;  the  more  that  is  borrowed,  the  more  negative  the  borrowing. 
So  the  TniniTmim  borrowing  is  actually  the  borrowing  with  the  largest  numerical  value. 

For  the  moment  we  treat  each  connected  component  as  an  indivisible  piece.  In  the 
general  case  this  is  not  allowable  and  we  will  show  later  why  this  is  valid  for  our  current 
case  of  two  Is  per  row.  First  let’s  show  why  our  schedule  is  optimal  under  this  assumption. 

We  will  come  up  with  an  ordering  <  on  pieces  of  the  type  t  =  {b,p)  where  b  is  the 
amount  of  borrowing  we  did,  and  p  is  the  profit  that  we  realized. 

Assume  that  we  have  two  tuples  ti  =  {bi,pi),  <2  =  (^ZiPz)-  The  <d-ordering  is  as  follows. 

•  If  Pi  and  p2  are  positive,  we  have  these  cases:  bi  <  62  ^  t>  tj,  =  ^2  ^  —  ^2) 

and  61  >  62  <i  h-  \ 

j 

•  If  Pi  and  P2  are  negative,  we  have  these  cases:  61  —  pi  <  62  -  P2  =r‘  <1  <d  <2)  -  Pi  = 

62  —  P2  <1  =  ^2j  and  61  —  Pi  >  62  —  P2  ^1  t>  t2- 


•  If  Pi  is  positive  and  p2  is  nonpositive  or  if  pi  =  0  and  p2  is  negative,  then  ti  <1  <2- 
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•  If  Pi  is  nonpositive  and  p2  is  positive  or  if  pi  is  negative  and  pa  —  0)  then  <i  t>  tj. 

•  If  Pi  =  Pa  =  0>  then  ti  =  ta- 

One  property  of  the  O -ordering  that  we  can  immediately  observe  is: 

Lemma  3.1  (6i,Pi)  <3  (i>2)P2)  (h)P2}  >  (htPi)-  I 

Lemma  3.2  Given  a  schedule  with  h  =  (fei.Pi)  directly  following  ta  =  (6a, P2)-  If  h  <  h 
or  <1  =  ta,  then  it  is  never  worse  to  swap  the  order  and  schedule  ti  directly  before  ta- 

Proof;  The  swapping  operation  has  no  global  effects.  So  all  we  need  to  look  at  is 
the  worst  borrowing  for  each  of  the  two  orders  within  their  local  portion  of  the  schedule. 
Without  swapping,  that  is  if  ta  i®  before  ti,  then  the  worst  borrowing  is  6  =  min(6a,  6i  -|- pa). 
If  we  swap  so  that  <i  is  before  ta,  then  the  worst  borrowing  is  h'  =  min(6i,6a  +  pi).  So  we 
must  show  that  if  t\  <3  <a  then  b'  >  b. 

Let’s  follow  the  case  structure  of  the  definition  of  <J. 

•  Pi  and  Pa  are  positive.  So  we  know  that  6i  >  63.  Hence,  61  -f  pa  >  6a  and  6  =  63.  So, 
b*  >  b  regardless  of  whether  b'  =  61  or  6'  =  63  -|-  Pi  • 

•  Pi  and  Pa  are  negative.  Since  ti  <1  <3  or  <1  =  ta ,  'we  know  that  63  —  pa  >  61  —  pi .  Hence, 

6a  >  61  -t-  Pa  and  6  =  61  -|-  pa.  So,  6'  >  6  regardless  of  whether  b'  =  61  or  6'  =  63  -H  pi. 

•  Pi  is  positive  and  pa  is  nonpositive  or  pi  =  0  and  pa  is  negative.  This  implies  that 

^1  >  61  -j-  Pa  and  63  -t-  pi  >  63.  Hence,  b'  >  6. 

•  Pi  =  P2  =  0.  This  implies  that  61  =  61  -I-  pa  and  63  -I-  pi  =  63.  Hence,  6'  =  6. 

I 


Lemma  3.3  For  any  set  of  independent  (borrowing,profit)  tuples,  there  is  an  optimal  or¬ 
dering  that  obeys  the  <1 -ordering. 

Proof:  Assume  that  we  have  some  schedule  5  that  is  A:  >  0  inversions  away  from  the 
nearest  <d-^d  ordering.  Clearly  there  is  some  place  where  two  adjacent  (6,p)  tuples  are 
out  of  order.  By  Lemma  3.2  we  can  swap  these  two  tuples  without  degrading  the  schedule. 
The  post  swap  schedule  5'  is  fc  -  1  inversions  away. 
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We  can  clearly  repeat  this  until  we  get  a  *:  =  0  schedule  that  is  at  least  as  good  as  5. 

I 

We  have  previously  stated  that  the  order  should  be:  (0, 1),  {-1,+k),  (-2,+A:),  (-1,0), 
(-2,0),  (-2,-1),  (-1,-1).  Now  we  can  see  that  that  was  a  simplification.  Components 
of  types  (-2,0)  and  (-1,0)  are  equivalent  and  can  be  interchanged  without  degrading  the 

schedule. 

3.2.3  Transposition  of  a  Chart 

With  what  we  have  now  defined,  we  can  take  a  detour  and  look  at  what  happens  to  a  chart 
for  a  matrix  G  when  we  transpose  its  matrix,  yielding  G'.  When  we  are  looking  at  the  chart 
of  G,  the  schedule  of  the  Y  nodes  is  determined  by  the  schedule  of  the  X  nodes;  when  we 
are  looking  at  the  chart  of  G',  the  schedule  of  the  X  nodes  is  determined  by  the  schedule 
of  the  y  nodes.  Hence,  when  looking  at  the  chart  of  the  transpose,  we  aren’t  necessarily 
interested  in  the  reverse  of  the  original  schedule  of  the  X  nodes,  rather  we  are  interested 
in  the  schedule  of  the  X  nodes  that  is  determined  by  the  reverse  of  the  original  schedule  of 
the  y  nodes  in  G.  This  difference  requires  us  to  explore  a  few  new  lemmas. 

Lemma  3.4  For  a  graph  G,  we  are  given  a  schedule  5  =  [®i, . .  .,a:n]  with  profit  p  and 
maximum  borrowing  6.  If  we  transpose  the  graph  yielding  G',  then  the  schedule  S'  = 
[j/n, . •  • , l/i]  will  have  profit  p'  =  -p  and  maximum  borrowing  b'  >b-p. 

Proof;  Since  the  original  profit  is  p  =  |y|  -  |Xl  and  the  profit  of  the  transpose  is 
p'  =  |X|  —  |y|,  we  can  see  that  p  =  — p. 

We  can  think  of  schedule  5'  as  [AT^,  AT^-i, . . . ,  ATo].  Let’s  compute  the  borrowing 
at  any  point  during  the  scheduling  of  some  AT*.  By  this  point  we  could  have  scheduled  at 
most  the  V  nodes  Y  \  We  will  have  released  at  least  the  X  nodes  Xm)  Xm.~i)  •  •  •  ? 

So  we  have; 

n'  >  |X\Si|-|y\T._i| 

=  |Ti_i|-s.-(|y|-|xi) 

=  fbi-p 

Since  b  is  the  Tninimiim  over  all  hi  and  b'  is  the  minimum  over  all  n',  then  V  >b-p.  I 
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no  no  ni  ni  na  na  ^3  na  n4  n4  ns 

Figure  3.4:  Chart  corresponding  to  transpose  of  the  graph  in  Figure  3.1 

Lemma  3.5  For  a  graph  G,  we  are  given  an  optimum  schedule  S  =  [xi,...,a:„]  with 
profit  p  and  maximum  borrowing  b.  If  we  transpose  the  graph  yielding  G\  then  the  schedule 
S'  =  [i/„, . .  .yi]  will  have  profit  p'  =  -p  and  maximum  borrowing  b'  =  b  -  p. 

Proof;  By  Lemma  3.4,  6'  >  b  -  p.  Take  G'  and  transpose  it  again  to  yield  G".  By 

Lemma  3.4,  b"  >  b'  -  p'.  Since  the  original  schedule  5  is  optimal,  b"  =  b.  So  we  have 

b  =  b"  >  b'  -  p'  >  b  -  p  -  p'  =  b.  Hence  b  =  b'  -  p'  and  b'  =  b  -  p.  I 

This  extends  to  individual  (5,p)  pieces,  provided  that  each  X  node  in  the  (5,p)  piece  is 
adjacent  to  some  T  node  in  the  piece.  Hence,  to  get  the  chart  of  the  transpose  of  the  graph 
in  Figure  3.1  we  first  take  each  {b,p)  of  the  original  graph  and  replace  it  with  {b  -  p,  -p) 
and  then  reverse  the  order  of  the  sequence  of  (5,p)  pairs.  So  for  our  example  we  start 
with  {(-2,1),  (-1,0), (-1,-1)]  as  shown  in  Figure  3.2.  This  gets  transformed  first  into 
[(_3,  _i), (_i, 0), (0, 1)1  and  then  into  [(0, 1),  (-1,0), (-3,-1)]. 

Figure  3.4  shows  the  chart  of  the  transpose  of  Figure  3.1  using  the  schedule  Sx  = 
[Ps ,  1/41  Psi  P2>  Pi]-  We  can  see  that  the  chart  for  Figure  3.4  does  precisely  follow  the  behavior 
[(0, 1),  (-1,0),  (-3,  -1)],  as  expected. 

A  few  other  things  to  note.  (1)  Since  (0,  l)s  and  (-1,  -l)s  are  transposes  of  each  other, 
this  (6,p)  -*  {b  —  p,  — p)  transformation  implies  that  we  chose  the  right  name  for  the  (0, 1) 
pieces.  (2)  This  transformation  of  sequences  of  (6,p)  pairs  is  its  own  inverse.  (3)  If  the 
original  sequence  of  (5,p)  pairs  obeys  the  <1  ordering,  then  so  does  the  transformed  one, 
since: 

Lemma  3.6  (6i,Pi)  <1  (&21P2)  4=^  (62 -P2) -P2)  <1  (*>i -Pii -Pi)-  (^’iiPi)  =  (^2)P2)  4=^ 
(62  -  P2,  -P2)  =  (^1  -  Pli  -Pi)-  (^liPl)  (*’2iP2)  4=^  {^2  -  P2,  -P2)  t>  (i»l  -  Pli  -Pi)-  I 


Proof:  Let’s  follow  the  case  structure  of  the  definition  of  <1. 
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•  Pi  and  p2  are  positive.  There  are  three  cases.  Case  1:  (fri,Pi)  <J  (*>2,P2)  <=►  > 

62  <=>  (bi-pi)-(-pi)  >  (h-P2)-(-P2)  <=>  (i»2-P2»-P2)<](^i-Pi)-Pi)-  Case2: 
(^>i>Pi)  =  {h,P2)  <=►  h  =  h  <=>  {h  -  Pi)  -  (-Pi)  =  (^2  -  P2)  -  (-P2)  <=> 
(b2-P2,-P2)  =  (i»i  -  Pi,-Pi)-  Case  3:  (6i,pi)  t>  (62,P2)  <!=►  <  62 

(61  -  pi)  -  (-Pi)  <  (62  -  P2)  -  (-P2)  <=>  (h  -  P2,  -P2)  t>  (bi  -  Pi,  -Pi)- 

•  Pi  and  p2  are  negative.  There  are  three  cases.  Case  1:  (61, Pi)  <1  {b2,P2)  <=> 

61 -Pi  <62 -Pi  <=>  {bi  -  Pi)  <  {b2  -  P2)  {b2  -  P2,-P2)  <  (bi  -  Pi,-Pi)- 

Case  2:  (61, Pi)  =  (62, P2)  <=>•  ^>i  -  Pi  =  <»2  -  P2  <=►  (^1  -  Pi)  =  (^»2  -  P2)  <=> 

(62 -P2, -P2)  =  (^1 -Pi> -Pi)-  3:  (6i,pi)  >  (62,P2)  •<=>■  bi-pi>b2-p2 

(&i  -  Pi)  >  (^2  -  P2)  <=>  (^2  -  P2,  -P2)  >  (^*1  -  Pi,  -Pi)- 

•  Pi  is  positive  and  p2  is  nonpositive  or  pi  =  0  and  P2  is  negative.  This  implies  that 

either  — pi  is  negative  and  — P2  is  nonnegative  or  — pi  =  0  and  — p2  positive.  This  is 
equivalent  to  saying  that  either  —p2  is  positive  and  — pi  is  nonpositive  or  —p2  —  0  and 
— Pi  is  negative.  Hence  both  (fri,pi)  <3  (fr2,P2)  (^2  —  P2»  — P2)  <1  (^1  —  Pi»  — Pi)- 

•  Pi  is  nonpositive  and  p2  is  positive  or  pi  is  negative  and  p2  =  0.  This  implies  that 
either  — pi  is  nonnegative  and  — P2  is  negative  or  — pi  is  positive  and  — P2  =  0.  This 
is  equivalent  to  saying  that  either  -p2  is  nonpositive  and  -pi  is  positive  or  -p2  is 
negative  and  — pi  =  0.  Hence  both  (bi,Pi)  t>(62,P2)and  (62— P2,— P2)  l>(i>i— Pi,  — Pi)- 

•  Pi  =  p2  =  0.  Then  — p2  =  — Pi  =  0.  Hence  both  (61, pi)  =  (i2,P2)  (62  —P2,  — P2)  = 

(fei  -Pi,-Pi)- 

I 


3.2.4  Indivisible  Components? 

Heturning  to  our  proof  for  two  Is  per  row,  we  still  have  to  show  that  we  can  treat  the 
components  as  indivisible.  That  is,  we  have  to  show  that  we  cannot  get  a  better  schedule 
by  allowing  a  schedule  to  start  with  one  component  and  switch  to  a  second  component  and 
then  back  to  the  first. 

This  is  not  true  in  the  general  case,  when  we  have  more  than  two  Is  per  row,  as  Figure  3.5 
shows.  The  reason  that  intermixing  schedules  is  an  improvement  in  Figure  3.5  is  because  a 
positive  point  in  the  chart  for  one  component  preceded  the  chart’s  lowest  point. 
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Because  of  dominance,  for  this  section  we  can  ignore  connected  components  with  only 
one  node. 

The  following  lemma  will  help  us  in  proving  that  we  can  treat  the  components  as  indi¬ 
visible. 

Lemma  3.7  Given  a  connected  bipartite  graph  G  =  (X,  Y,  E)  in  which  the  maximum 
degree  of  the  Y  nodes  is  <  2.  For  every  non-empty  subset  0  C  X'  C  X,  we  have  |r'(X')|  — 
jX'l  <  |r'(Jr)|-l.Y|  =  n-m. 

Proof:  As  this  will  be  a  proof  by  contradiction,  assume  that  we  have  a  set  X'  such 
that  |r'(X')|  -  |X'l  >  |r'(X)|  -  \X\.  Let  Y'  =  r'(X'),  X'>  =  X\  X'  and  Y"  =  Y\  Y'. 

Consider  the  graph  induced  by  the  nodes  X"  U  Y".  It  has  some  number,  c,  of  connected 
components.  Thus  there  must  be  at  least  |X"|  -|-  \Y"\  —  c  edges  between  X"  and  Y".  By 
definition  of  r^(X^),  there  are  no  edges  from  X"  to  Y'  —  Since  our  bipartite  graph  is 

connected,  there  must  be  at  least  c  edges  between  Y"  and  X' .  Thus  there  at  least 
edges  that  have  one  endpoint  in  Y" .  Since  |X”|  =  |Jr|  —  |X'|  >  |r'(A^)j  —  |r'(Jr')|  =  |y”|, 
then  there  are  >  2  •  |y"|  edges  that  have  one  endpoint  in  \Y"\.  But  every  node  in  ly"!  is  of 
degree  <  2.  This  is  a  contradiction.  I 

Corollary  3.8  Given  any  schedule  S  for  a  connected  bipartite  graph  G  =  {X,Y,E)  in 
which  the  maximum  degree  of  the  Y  nodes  is  <  2.  It  follows  that  maxJ^j  Uj  =  I 

Corollary  3.9  Given  any  schedule  5  for  a  connected  bipartite  graph  G  =  {X,Y,E)  in 
which  the  maximum  degree  of  the  Y  nodes  is  <  2.  Either  maxj^o  ^  or  maxjlo  nj  =  tiq. 

I 

Definition  3.6  For  any  specified  schedule,  we  will  refer  to  nj  as  the  balance  at  time  t.  And 
maxjlo  rii  will  be  referred  to  as  the  peak  balance. 

So  for  every  connected  component  (in  the  two  Is  per  row  case),  the  peak  balance  occurs 
at  the  beginning  and/or  the  end  of  the  schedule.  Also  note  for  any  schedule  for  any  of 
the  {b,p)  pieces  we  are  dealing  with,  the  balance  must  dip  to  6  -1-  1  before  the  balance 
becomes  positive- — this  is  a  more  complicated,  and  more  useful,  way  of  saying  that  (—2,  ±fc) 
components  must  schedule  two  X  nodes  before  they  release  any  Y  nodes. 

What  happens  to  the  peak  balance  when  we  transpose  a  component? 
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Lemma  8.10  Given  a  connected  graph  G  such  that  for  any  schedule  S  of  it  maxj^o  ni  =  P 
and  such  that  for  any  schedule  S'  for  G',  the  transpose  of  G,  we  have  that  maxjlo  nj  > 
|X|  -  \Y\  +  P.  Then  given  any  schedde  S'  for  G',  we  have  that  maxj^o  K  =  l-^l  "  1^1  +  P- 

Proof;  Assume  to  the  contrary  that  we  have  a  schedule  S'  for  G'  with  max^o  = 
n'j  >  \X\  -  \Y\  +  P.  That  is,  |r'(5j)|  -  |5;|  >  |X|  -  iy|  +  P.  Then  in  the  original  graph,  if 

we  start  a  schedule  with  X  \  r'(5'),  we  release  Y  \  S'^.  Thus  the  balance  at  this  point  is: 

|r\5;i-|x\r'(5j)|  =  m-|X|  +  (|r'(5;)|-|5j|) 

>  |y|-m  +  (|x|-|y|  +  P) 

=  p 


This  is  a  contradiction.  | 

Corollary  3.11  Given  a  connected  graph  G  such  that  for  any  schedule  5  of  it  maxjlo  = 
Tirn  =  |y|  -  |X|.  Given  any  schedule  S'  for  G',  the  transpose  of  G,  we  have  that  maxj^o  K  = 
no  =  0. 

Corollary  3.12  Given  a  connected  graph  G  such  that  for  any  schedule  5  of  it  maxj^o  nj  = 
no  =  0.  Given  any  schedule  S'  for  G',  the  transpose  of  G,  we  have  that  max^o  n^  =  n„j  = 

1^1 -m- 

Corollary  3.13  If  a  connected  graph  has  its  peak  balance  at  the  end  (beginning)  of  every 
schedule  of  it,  then  the  transposed  version  of  the  graph  has  its  peak  balance  at  the  beginning 
(end)  of  every  schedule  of  it. 

In  talking  about  intermixing  of  multiple  components,  complicated  notation  seems  un¬ 
avoidable.  We  will  have  a  number  of  components  Ci,C2,  •  •  Each  one  will  have  a 

schedule  5^  5*, . . . ,  5*.  Let  n)  =  |r'(5j)i  -  15)1 .  There  wiH  be  a  schedule  S  which  is  an 
intermixing  of  the  schedules  5^,  5*, . . .,  5*.  Let  u{i,j)  =  the  number  of  X  nodes  from  5* 
that  are  scheduled  before  or  during  time  j  in  schedule  5.  Hence,  Uj  = 

Lemma  3.14  Given  k  components  Ci,...,Ck  such  that  (1)  every  component  either  has 
its  peak  balance  at  the  end  of  every  possible  schedule  for  it  or  it  has  its  peak  balance  at  the 
beginning  of  every  possible  schedule  for  it  (2)  at  least  one  component  has  its  peak  balance 
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at  the  end  of  every  possible  schedule  for  it  and  (3)  any  schedule  of  any  (6i,Pi)  component  Ci 
dips  to  a  balance  of  1  before  the  balance  becomes  positivej  given  schedules  S  j . . . ,  S 
for  the  components  and  given  an  intermixed  schedule  5  of  the  components.  There  exists 
a  schedule  5'  that  has  only  /b  -  1  of  the  components  intermixed  and  the  other  component 
scheduled  first,  such  that  S'  has  a  delay  no  worse  than  5. 

Proof!  Without  loss  of  generality,  assume  that  Ci  is  a  minimum  component  with  respect 
to  the  <1  ordering.  Let  S'  be  a  schedule  with  Ci  scheduled  first  and  scheduled  optimally, 
followed  by  Ca, . . . ,  C'fc  intermixed  in  the  same  order  as  they  are  in  5. 

Let  mi  =  the  number  of  X  nodes  in  Ci.  If  the  j-th  X  node  in  5  is  not  part  of 
Cl,  then  in  S'  it  will  be  scheduled  in  position  j  +  mi  -  u(l,y).  In  S',  for  j'  >  mi, 
n'j>  =  Ei=i  =  "mi  +  Ei=2  J  =  J  +  -  "(1.  j)-  We  claim 

that  >  ni(i,j)  and  hence  n'-,  >  n,.  We  can  see  that  since  they 

are  both  the  balance  at  the  end  of  schedules  of  Ci.  Claim:  Since  Ci  is  minimum  with 
respect  to  <1  and  at  least  one  component  of  Ci,. .  .,Ck  has  its  peak  balance  at  the  end  of 
every  possible  schedule  for  it,  then  Ci  is  one  such  component  and  hence 
all  j.  To  see  this,  note  that  for  all  components  that  satisfy  condition  (1)  in  the  statement 
of  the  lemma:  (a)  a  component  has  its  peak  balance  at  the  end  of  every  possible  schedule 
for  it  iff  the  component  has  a  nonnegative  profit  (b)  a  component  has  its  peak  balance  at 
the  beginning  of  every  possible  schedule  for  it  iff  the  component  has  a  nonpositive  profit. 
Hence,  some  component  has  a  nonnegative  profit.  Prom  the  definition  of  <,  it  follows  that 
Cl  has  a  nonnegative  profit.  Hence,  Ci  has  its  peak  balance  at  the  end  of  every  possible 
schedule  for  it. 

Now  all  we  need  to  show  is  that  min^i  n'j  >  min^i  tij.  To  see  this,  let’s  look  at  S. 
In  particular,  let’s  look  at  the  smallest  value  of  I  such  that  =  6^'  +  1  for  some  5* 

and  pii  >  0.  Since  every  component  reaches  a  balance  of  6^  +  1  <  0  before  its  balance 
goes  positive,  then  <  0  for  all  i  ^  i'.  Hence  min!^i  nj  <  nj  =  E?::!  "u(t,J)  — 

Since  we  have  scheduled  Ci  optimally  in  S'  and  since  Ci  is  minimum  with  respect  to  <1 , 
"u(i'  i)  =  bii  +  l<bi  +  l  =  min^’i  n'-  and  hence  min^i  n'-  >  min^i  ny.  | 

Because  of  Corollary  3.13  and  Lemma  3.6,  transposition  gives  us: 

Corollary  3.16  Given  k  components  Ci, . . . , Cfc  such  that  (1)  every  component  either  has 
its  peak  balance  at  the  end  of  every  possible  schedule  for  it  or  it  has  its  peak  balance  at  the 
beginning  of  every  possible  schedule  for  it  (2)  at  least  one  component  has  its  peak  balance 
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at  the  beginning  of  every  possible  schedule  for  it  and  (3)  any  schedule  of  the  transpose  of  any 
(bi,pi)  component  Cj  dips  to  a  balance  oi  bi  —  pi  +  1  before  the  balance  becomes  positive, 
given  schedules  . .  .,5*  for  the  components  and  given  an  intermixed  schedule  S  of  the 
components.  There  exists  a  schedule  S'  that  has  only  A:  -  1  of  the  components  intermixed 
and  the  other  component  scheduled  last,  such  that  S'  has  a  delay  no  worse  than  S.  | 

Lemma  8.16  Given  k  components,  each  with  less  than  two  Is  per  row.  There  is  an  optimal 
schedule  that  does  not  intermix  the  components. 

Proof:  This  is  trivially  true  for  the  base  case  of  the  induction,  when  k=l,  since  there 
is  no  intermixing  possible. 

Otherwise,  assume  that  there  is  an  optimal  schedule  S  that  has  intermixing.  For  A  >  1, 
either  there  exists  a  component  which  has  its  peak  balance  at  the  end  of  every  possible 
schedule  for  it,  or  there  doesn’t.  If  such  a  component  exists,  then  Lemma  3.14  tells  us  that 
we  can  find  a  schedule  S'  that  begins  with  one  component  C  separated  out  from  the  others 
and  has  <  fc  -  1  of  the  components. intermixed  and  such  that  delay(5')  <  delay(5).  Hence 
S'  is  also  optimal.  By  induction  on  G\C ,  we  obtain  an  optimal  schedule  S  oi  G\C  that 
does  not  intermix  the  <  A:  -  1  intermixed  components  of  G  \  C.  We  can  append  S"  to 
the  optimal  schedule  of  C,  yielding  an  optimal  schedule  of  the  whole  graph  that  does  not 
intermix  components. 

If  there  does  not  exists  a  component  which  has  its  peak  balance  at  the  end  of  every 
possible  schedule  for  it,  then  for  every  component,  every  schedule  has  its  peak  balance 
at  the  beginning.  Thus  we  must  have  only  (-2,-1)  components.  Any  schedule  of  the 
transpose  of  a  (—2,  —1)  reaches  a  balance  of  —2  —  (—1)  +  1  =  0  before  the  balance  becomes 
positive.  Thus  all  the  preconditions  of  Corollary  3.15  hold.  Once  again,  this  corollary  gives 
us  a  schedule  S'  that  has  one  less  intermixing  than  S  and  then  we  use  induction  on  the 
remaining  intermixed  portion  of  S'  to  yield  an  optimal  schedule  with  no  intermixing.  I 


3.3  Merging 

We  have  seen  that  when  there  are  only  two  Is  per  row,  we  can  independently  solve  the 
separate  connected  components  and  then  merely  append  the  solutions  in  an  appropriate 
order  to  yield  the  overall  solution.  As  Figure  3.5  showed,  this  does  not  apply  for  the  more 
general  case.  Thus  a  question  arises:  In  the  general  case,  can  we  solve  separate  pieces 
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independently  and  then  perform  some  simple  method  of  combining  the  solutions?  Or  must 
we  view  all  separate  pieces  at  the  same  time,  as  we  try  to  solve  the  problem? 


3.3.1  Peak  Balance 

Our  first  intuitive  observation  is  that  the  <3 -ordering  works  because  it  gets  the  balance  as 
high  as  possible  so  that  subsequent  dips  will  not  go  very  far  negative. 

Let’s  apply  this  intuition  of  goodness  to  a  single  component.  What  is  the  largest  possible 
peak  balance?  Can  it  be  achieved  by  an  optimum  schedule? 

Lemma  3.17  For  a  graph  with  ti  rows  and  m  columns  and  with  a  maximum  matching  M 
of  the  graph,  the  peak  balance  is  at  most  n  -  \M\. 

Proof:  Claim:  for  any  set  Y'  C  y,  \Y'\  -  |r(y')l  <  1^1  -  |M|.  To  see  this:  Consider, 
y'  n  M  and  r(y')  n  M.  Clearly  for  each  distinct  node  in  the  first  there  is  a  distinct 
node  in  the  second  that  is  adjacent  to  it.  That  is,  [y^  fl  M\  <  |F(y  )  n  M\.  Hence, 

|y'l  -  |r(r)l  <  (|i"'l  -  \y'  n  m\)  -  (|r(y')l  -  |r(y')  n  m|)  <  ir|  -  |y'  n  m|  <  |y|  -  1m|. 

Claim:  for  any  set  X'  C  X,  |r'(XOI  -  <  1^"!  -  |M|.  To  see  this:  first  note  that 

TiT'{x'))  c  X'.  Thus,  \r{x>)\  -  |X'|  <  |r'(x')|  -  \nr'{x'))\  <  |y|  -  \m\. 

Since  the  balance  of  a  schedule  5  is  tia,  =  |r'(5fc)|  -  15fc|  and  since  n  =  jy],  we  see  that 
Tifc  <  n  -  |M|  for  any  k  and  any  schedule  5.  | 

The  more  interesting  claim  is  that  for  every  graph,  there  is  a  minimum  delay  schedule 
that  has  a  peak  balance  of  n  —  |M|.  To  show  this  we  need  a  more  complicated  variant  of  our 
“inversion”  strategy  used  in  Lemma  3.3.  The  added  complication  comes  from  the  fact  that 
the  pieces  that  we  will  be  shifting  around  will  no  longer  be  independent.  Instead,  they  wiU 
have  edges  to  common  nodes.  Thus,  once  again,  we  will  need  to  get  more  mathematical  in 
order  to  derive  a  few  useful  lemmas. 

3.3.2  Basic  Lemmas 

For  the  rest  of  this  chapter,  assume  that  all  nodes  in  X  have  at  least  one  neighbor,  since 
any  nodes  in  X  with  no  neighbors  are  optimally  placed  at  the  end  of  the  schedule.  As  noted 
earlier,  this  sometimes  allows  us  to  concentrate  on  dm  instead  of  dm-i  • 

The  first  lemma  tells  us  the  obvious:  if  we  enlarge  (shrink)  the  set  of  nodes  removed 
from  one  side,  we  enlarge  (shrink)  the  set  of  isolated  nodes  on  the  other.  It  also  gives  us  an 
explicit  formula  relating  these  sets  of  isolated  nodes. 
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Lemma  3.18  For  any  two  sets  of  nodes  such  that  Zi  C  Z2,  we  have  T'(Zi)  C  r'(Z2)  and 

r'(Zt)  =  r'(Z2)\r(Z2\Zr). 

Proof:  If  2  e  r'(^i)  then  r(2)  C  Zi  C  Z2.  Hence,  z  G  T'(Z2)  and  r'(Zi)  C  r'(Z2). 

If  2  G  r'(Z2)  \  r(Z2  \  Zi)  then  it  is  not  adjacent  to  any  node  in  Z2  \  Zi.  Since  r(2)  C  Z2, 
then  r(2)  C  Zi  and  2  G  P'W- 

If  2  G  r'(Zi)  then  r(2)  C  Zi.  Since  we’ve  already  shown  that  r'(Zi)  C  ^(^2),  we  have 
2  G  r*(Z2)-  Since  2  is  not  adjacent  to  ^2  \  must  have  2  G  r’(^2)  \  ^(^2  \  ^i)-  Hence, 

r'(Zr)  =  r'(Z2)\r(Z2\Zi).  i 

The  next  three  lemmas  help  us  nail  down  a  specific  formula  for  ATj. 

Lemma  3.19  Given  a  schedule  5  =  [ii,  12,  •  •  • ,  im]  and  a  node  y  G  F ,  we  have  y  G  ATi  if 
and  only  if  i  is  the  largest  integer  such  that  Xi  G  r(y). 

Proof:  Let  i  be  the  largest  integer  such  that  Xi  G  r(y).  Clearly  r(y)  C  Si,  hence 
y  e  r'(5i)  =  Ti.  Since  Xi  G  r(y),  r(y)  g  5i_i.  Thus,  y  i  T'{Si.i)  =  Ti_i.  So  y  G  ATi. 

Since  each  node  y  €  T  is  in  exactly  one  ATi,  the  other  direction  of  the  implication  is 
immediate.  | 

The  function  r'(X')  tells  us  what  Y  jobs  will  be  released  by  the  jobs  in  X'  if  they  are 
the  only  jobs  executed.  What  happens  when  X'  is  intermixed  with  other  X  jobs  in  some 
schedule?  This  causes  the  set  of  Y  jobs  released  by  the  jobs  in  X'  to  vary.  Nevertheless, 
this  set  can  be  bounded  on  both  sides. 

Lemma  3.20  For  any  schedule  5  =  [®i, . .  and  any  set  X'  C  X,  'me  have 

rWc  U  ATi  Q  T{X'). 

xi^X' 

Proof:  Choose  any  y  G  r'(X').  Let  i  be  the  largest  integer  such  that  G  r(y).  By 
Lemma  3.19,  y  G  ATi.  Since  r(y)  C  X',  we  have  Xi  G  X'  and  hence  y  G  ATi  C 

Choose  any  y  G  ATi,  with  Xi  G  X'.  By  Lemma  3.19,  G  r(y).  Hence  y  G  r(xi)  C 
T{X').  I 

Lemma  3.21  For  any  schedule  5  =  [ii,...,Xm]>  we  have  ATi  =  r'(5i) nr(®t). 
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Proof:  Since  Ti  =  r'(5.),  ATi  C  r'(5.).  By  Lemma  3.20,  ATi  C  r(®i).  Hence, 
ATiCT'(Si)nr(x^). 

If  Vi  €  r'(Si)  then  yi  €  Ti.  If  y,  €  r(xi)  then  yi  i  r._i.  Hence,  yi  €  ATi  and 

ATiDT'{Si)mxi).  I 

Lemma  S.22  For  any  two  sets  of  nodes  such  that  Xi  C  X2  and  any  schedule  5  = 
[H,  12, . . . ,  ®m]  of  X2,  we  have 

r'(Xi)  c  r'(X2)  \  U  ^  r'(X2)  \  r'(X2  \  x^). 

xiGXi\Xv 

Proof:  By  Lemma  3.18,  r'(Xi)  C  r'(X2)\r(X2\Xi).  By  Lemma  3.20,  Ux.-gX,\X,  ^ 
r(X2  \  Xi).  Hence,  r'(Xi)  C  r'(X2)  \  U*.ex,\Xi  Lemma  3.20,  T'{X2  \  Xi)  C 

Ux,€X,\x,  ATi.  Hence,  r'(X2)  \  Ux^ex^x,  ^T.  C  r'(X2)  \  r'(X2  \  X^).  I 

3.3.3  Segregation 

To  go  further  we  also  need  a  new  tool.  Our  new  tool  is  caDed  segregation.  Prom  a  schedule  5, 
say  Sx  =  [®i,®3,®6,*4.®6,®2].  we  will  modify  it  to  form  a  segregated  schedule  S'.  The 
way  this  works  is  this:  First,  we  assign  each  node  Xi  a  label  from  a  set  of  labels;  for 
this  example,  let  our  label  set  be  {one, two, three}  and  let  ^(ii)  =  one,  L(®2)  =  one, 
L{x3)  =  two,  L{x4)  =  three,  L(®s)  =  three,  L{xs)  =  two.  The  set  of  labellings  has  an 
ordering,  say  [one,  two, three]. 

To  form  the  segregated  schedule  we  first  use  aU  the  nodes  of  the  first  label,  then  aU  the 
nodes  of  the  next  label,  and  so  on,  until  all  the  nodes  are  scheduled.  All  the  nodes  of  a 
particular  label  will  remain  in  the  relative  order  that  they  have  in  the  schedule  S.  So  in 

our  example,  the  result  will  be  S'^  =  [®i,®2.®3*®6i®5i®4]- 

Like  our  use  of  inversion  in  the  two  Is  per  row  case,  we  will  perform  these  segregations 
in  ways  that  won’t  degrade  the  schedule.  So,  sometimes  we  wish  to  impose  two  conditions: 

•  The  segregated  schedule  always  has  a  delay  no  larger  than  the  original  schedule. 

•  The  labels  can  be  assigned  efficiently  without  knowing  an  optimal  schedule. 

Then  we  can  use  the  labels  to  divide  our  problem  into  smaller  scheduling  subproblems,  one 
per  label.  Optimal  solutions  to  the  subproblems  can  then  just  be  concatenated  to  yield  an 
optimal  solution  to  our  original  problem. 
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Let  5  =  [xi,  be  a  schedule.  Applying  segregation  to  it  wiU  result  in  some  other 

schedule  S'  =  [xi, . .  A  specific  segregation  of  a  specific  schedule  5  can  be  viewed  as 

a  permutation  tt;  that  is,  Xi  = 

Lemma  3.23  If  Si  C  5;^^)  then  ATj  C  If  5<  D  5;(-)  then  Ar<  D  ^K(iy 

Proof:  Since  r(xj)  =  r(xj^(,))  and  since  Lemma  3.18  tells  us  that  5,-  C  implies 
r'(5i)  c  r(5;(.)),  we  know  that  r'(5i)nr(®i)  C  T'{S'^^i))nnKii))-  By  Lemma  3.21,  this 
tells  us  that  AT^  C  AT^^-y  I 

We  can  always  think  of  more  complex  segregations  as  a  composition  of  two-label  seg¬ 
regations,  that  is  segregations  with  |L|  =  2.  Let  that  label  set  be  L  =  {one,  two};  we  will 
use  the  segregation  [one, two].  Lemma  3.23  teUs  us  that  for  two-label  segregations,  if  x, 
has  label  two  then  ATi  C  AT^^^^  and  if  Xj  has  label  one  then  AT,  D  ^K^i)-  The  following 
three  lemmas  give  us  a  set  of  conditions  when  we  can  be  sure  that  applying  a  two-label 
segregation  doesn’t  increase  the  delay. 

Lemma  3.24  Let  /  be  the  number  of  X  nodes  labelled  one.  If  applying  a  two-label 
segregation  with  the  order  [one,  two]  to  a  schedule  5  yields  a  segregated  schedule  S'  with 
n'f  =  n'i,  then  for  all  Xj  with  label  two. 

Proof:  Consider  any  Xj  node  labeled  two.  Clearly,  Si  =  \  (5}  \  where  Xj 

is  the  one  node  most  recently  preceding  Xj  in  the  schedule  5.  (If  Xj  is  non-existent,  then 
using  ir{j)  =  0  makes  the  foUowing  reasoning  still  applicable.)  Since  Si  C  S'^^y  applying 
Lemma  3.22  with  schedule  S'  tells  us  that 

Ti  =  TiSi) 

=  r'(s;,i,)\(r'(s;)\r'(5;,,,)) 

=  r;,,.)  \  (t;  \  ri,,,). 

Hence, 

ni  =  m|-|5i| 

<  ir;,,)  \  (r;  \  -  isiw  \  {S',  \ 

=  (iiiioi  -  n + in,,)!)  -  (i^;(oi  -  is/i + i^i(i)i) 


3.3.  MERGING 


53 


"»(i)  -  Tl/  +  nlr(i) 

<(0- 

I 

Lemma  3.26  Let  /  be  the  number  of  X  nodes  labelled  one.  If  applying  a  two-label 
segregation  with  the  order  [one,  two]  to  a  schedule  5  yields  a  segregated  schedule  S'  with 
ny  =  maxj^y  n'-,  then  >  Ui  for  all  i,  with  label  one. 

Proof:  Consider  any  Xi  node  labeled  one.  Clearly,  5,-  =  \  {S'f  \  ’’'^here  Xj 

is  the  two  node  most  recently  preceding  Xi  in  the  schedule  S.  (If  Xj  is  non-existent,  then 
Si  =  and  Ti  =  and  hence  m  =  n'^^^y)  Since  Si  C  S'^^jy  applying  Lemma  3.22 

with  schedule  S'  tells  us  that  Ti  C  \  (7/  \  Hence, 

n;  =  \Ti\-\Si\ 

<  Ku)\i^\K(i))\-\Ku)\iSf\K(i))\ 

=  (|T;(y)i  -  |r;i  -f-  -  (|5;(y)|  -  \s'f\  -f-  \s'^y^\) 

=  "w(i)  -  "/  +  ”»(i) 

<  <(0- 

I 

Lemma  3.26  Let  /  be  the  number  of  X  nodes  labelled  one.  If  applying  a  two-label 
segregation  with  the  order  [one,  two]  to  a  schedule  5  yields  a  segregated  schedule  S'  with 
n'f  =  maxjlo^i.  ^ 

Proof:  Lemmas  3.24  and  3.25  both  apply  and  hence  we  know  that  Ui  <  <(.)  for  all  t. 
It  is  thus  clear  that  dm  ^  I 

We  will  explore  some  of  the  power  of  this  lemma  later.  A  simpler,  but  still  useful 
segregation  lemma  is: 

Lemma  3.27  Applying  a  two-label  segregation  to  a  schedule  5  that  assigns  the  label  one 
to  a  single  node  Xk  E  X  such  that  r(®jk)  3  Tk  and  the  label  two  to  all  the  other  nodes  in 
X  results  in  a  schedule  with  d'm^  dm- 
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Proof:  If  Tk  =  0,  dk  =  -k  =  d'^.  If  T*  7^  0,  then  Tk-i  =  0  by  hypothesis,  hence 
dk  =  ^  —  k  and  >  1  —  A.  Hence,  >  d*..  For  all  *  >  A,  ®t  =  ®ir(t)’  Hence,  Si  — 

Ti  =  and  rii  =  n[.  Thus  d'^>  d^-  I 

Lemma  3.27  helps  us  reaffirm  something  that  we  already  knew  because  of  dominance: 

Corollary  3.28  Applying  to  a  schedule  5  a  two-label  segregation  that  assigns  the  label 
one  to  a  set  of  nodes  x*  €  X  such  that  r(xfc)  =  Y  and  the  label  two  to  all  the  other  nodes 
in  X  results  in  a  schedule  with  d!^>  dm-  I 

Also,  Lemma  3.27  helps  us  to  validate  the  legitimacy  of  making  certain  local  modifica¬ 
tions  to  schedules. 


Lemma  3.29  Given  a  schedule  5  =  [xi, . .  .,im]'  If  ^Ti+i  —  ATj+a  —  ...  —  ATfc_i  —  0 
and  ATfc  ^  0,  then  the  schedule 

S  —  (xi,  .  .  . ,  Xj,  Xkj  Xt.|_i,  .  .  . ,  Xk—1 ,  X^.^1 ,  .  .  . ,  Xm^ 


will  have  d'^>  dm - 

Proof:  Apply  Lemma  3.27  to  the  graph  G  \  (5i  U  Ti)  and  the  schedule  S\Si.  I 

There  is  another  use  for  segregations.  Sometimes  they  allow  us  to  find  an  optimum 
schedule  for  a  subgraph,  given  an  optimum  schedule  for  the  whole  graph. 

Lemma  3.30  Let  /  be  the  number  of  X  nodes  labelled  one.  Let  5  be  an  optimal  schedule, 
and  let  the  two-label  segregation  yield  a  schedule  S'.  Let  S'f  =  H  dj„_i  = 

dro.i  and  >  din-u  “  optimal  schedule  for  the  graph  G'  induced  by  the 

nodes  (5^  \  5^)  \  ^y)- 

Proof:  Assume  that  S'f  is  not  optimal  for  G' -  This  means  that  there  is  a  schedule  S'} 
that  is  better.  Then  the  schedule  [5},  B'j\  would  be  a  better  schedule  than  S'  and  5.  This 
contradicts  our  assumptions  on  5  and  thus  S'f  is  optimal  for  G'.  I 
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3.3.4  Decompose 

Let’s  show  a  method  to  assign  labels  from  the  label  set  L  =  {ascent,  plateau,  descent} 
to  the  Xi  nodes  without  knowing  an  optimum  schedule,  and  in  such  a  way  that  given  any 
optimal  schedule,  the  corresponding  segregated  schedule  will  not  have  a  larger  delay.  For 
this  purpose  we  assume  that  a  maximum  matching  of  the  graph  G{X,Y')  has  been  given. 

Definition  3.7  An  alternating  path  is  a  path  that  alternates  between  edges  in  a  matching 
and  edges  not  in  the  matching.  That  is,  it  never  consecutively  uses  two  edges  not  in  the 
matching. 

To  simplify  talking  about  this  decomposition,  we  will  modify  the  bipartite  graph.  The 
edges  are  normally  all  oriented  from  JT  to  3^.  Whenever  an  edge  is  in  the  matching,  we  will 
reverse  the  orientation  so  that  the  edge  will  point  from  y  to  X.  Thus  alternating  paths 
now  correspond  to  directed  paths  in  the  modified  graph. 

First  find  any  maximum  matching,  M,  of  the  bipartite  graph.  There  are  |X|  —  \M\ 
unmatched  XjS  and  |y|  -  |M|  unmatched  j/iS.  The  ascent  is  composed  of  the  |y|  -  \M\ 
unmatched  j/jS  and  any  nodes  that  can  reach  them  via  directed  paths  in  the  modified  graph. 
The  descent  is  composed  of  the  |X|  -  \M\  unmatched  ijS  and  any  nodes  reachable  from 
them  via  directed  paths  in  the  modified  graph.  The  plateau  is  composed  of  any  nodes  that 
aren’t  reachable  from  and  can’t  reach  to  any  unmatched  node  via  a  directed  path  in  the 
modified  graph.  (Note  tha;t  there  cannot  be  a  directed  path  between  unmatched  vertices, 
since  M  is  maximum.) 

This  seems  like  a  non-deterministic  labeling,  since  we  say  “find  any  maximum  match¬ 
ing.”  Actually,  the  labeling  is  unique. 

Lemma  3.31  The  node  labelings  are  invariant  under  different  maximum  matchings. 

Proof:  By  inverting  a  path,  we  mean  changing  the  matching  edges  on  it  into  non¬ 
matching  edges  and  vice-versa.  In  our  modified  graph,  this  corresponds  to  changing  the 
orientations  of  every  edge  on  the  path.  It’s  a  well  known  result  in  graph  theory  that  any 
maximum  matching  can  be  converted  into  any  other  by  a  sequence  of  operations  of  the 
form:  either  invert  an  even  length  alternating  path  starting  or  ending  at  an  unmatched 
node,  or  invert  an  alternating  cycle. 
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If  W6  invert  a.  directed  cycle  we  haven’t  changed  the  set  of  unmatched  nodes,  nor  have 
we  changed  the  reachability  from  any  node  to  any  other  node.  So  clearly  the  labelings  don’t 
change. 

If  we  invert  a  directed  even  length  path,  then  we  do  change  the  set  of  unmatched  nodes. 
There  are  two  mutually  exclusive  cases:  either  the  first  node  of  our  directed  path  is  an 
unmatched  JC  node  or  the  last  node  of  our  directed  path  is  an  unmatched  Y  node. 

Say  that  the  first  node  is  an  unmatched  X  node.  Then  after  inversion,  this  previously 
unmatched  node  will  become  matched  and  we  will  instead  have  a  new  unmatched  X  node  at 
the  end  of  the  original  directed  path.  Thus  this  new  unmatched  X  node  is  at  the  beginning 
of  the  inverted  directed  path  and  all  nodes  on  the  path  are  now  reachable  from  this  new 
unmatched  X  node.  Hence  the  labeling  will  remain  the  same. 

Similarly,  if  there  originally  was  an  unmatched  Y  node  at  the  end  of  the  original  directed 
path,  then  after  inversion  there  will  be  an  unmatched  Y  node  at  the  end  of  the  inverted 
directed  path,  and  hence  the  labeling  will  remain  unchanged.  | 


3.3.5  APD-Segregated  Schedule 

Let  5  =  be  an  optimum  schedule.  Segregate  the  schedule  in  the  order: 

[ascent, plateau,  descent].  We  will  call  such  orderings  AP Desegregated  schedulings. 

The  following  lemma,  while  almost  equivalent  to  Lemma  3.17,  is  more  constructive. 

Lemma  3.32  Given  a  bipartite  graph  G  =  (X, y,  JE?)  with  a  maximum  matching  M,  we 
have  rnlnx^cxm  -  |rWI)  =  \M\  -  irj. 

Proof:  Let  V  be  the  set  of  unmatched  nodes  of  Y  plus  all  nodes  from  which  they  can 
be  reached  by  alternating  paths.  Let  X^  =  V  f]X.  Then  r^(X^)  consists  of  the  unmatched 
nodes  of  Y  plus  the  nodes  matched  with  X';  hence  |X'|  —  |r'(X')|  =  \M\  —  |y|. 

If  X*  is  any  subset  of  X,  if  y  G  y  is  in  r^(X^)  and  if  (®,y)  €  M ,  then  ®  G  X  ,  Hence 
|r'(X')|  IX'I  <  |y|  -  |M|.  So,  |M|  -  |y|  is  the  smallest  possible  value.  I 


Lemma  3.33  For  any  schedule  5  =  the  APD-segregated  schedule  5'  = 

[xj , . . . ,  x[^]  corresponding  to  S  has  >  dm- 
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Proof;  First,  let  us  add  the  label  p-or-d  to  every  node  with  the  label  plateau  or 
descent.  Let  us  show  that  for  any  schedule,  the  segregated  schedule  using  the  order 
[ascent,  p-or-d]  is  at  least  as  efficient  as  the  original  schedule  S. 

Let  /  be  the  number  of  X  nodes  labeled  with  ascent.  By  Lemma  3.32,  maXj_onj  = 
niax;i(,(|r'(5-)l  -  ^  1^1  “  l-^l-  "•/  =  1^1  “  maximum 

is  achieved  at  maxj^o^i  ~  ”/•  Lemma  3.26  tells  us  that  d'^>  dm.. 

Instead,  let  us  add  the  label  a-or-p  to  every  node  with  the  label  ascent  or  plateau. 
Again  we  will  show  that  for  any  given  schedule,  if  we  segregated  schedule  using  the  order 
[a-or-p,  descent]  we  get  a  new  schedule  at  least  as  efficient  as  the  original  schedule  S. 

Let  /  be  the  number  of  X  nodes  labeled  with  a-or-p.  By  Lemma  3.32,  maXj_on.i  = 
niaxt^o(ir'(-5.0l  -  ^  1^1  -  1^1-  Since  n)  =  |y|  -  |M|,  we  can  see  that  the  maximum 

is  achieved  at  max^^o^J  =  Hence  Lemma  3.26  tells  us  that  d'm  ^  dm. 

Since  both  of  the  above  segregations  work  for  any  initial  schedule  S ,  they  can  be  com¬ 
posed.  By  composing  the  two  segregations,  we  see  that  a  [ascent,  plateau,  descent]  seg¬ 
regation  of  a  schedule  S  yields  a  schedule  S'  with  d'm  ^  dm-  I 

By  segregating,  we  have  reduced  our  problem  to  a  few  more  specialized  ones.  If  we  solve 
each  of  them  optimally,  we  will  have  solved  our  original  problem  optimally. 

ASCENT 

INSTANCE:  A  time  bound  t  and  a  bipartite  graph  G  =  (X,y,  E)  such  that  |jr|  <  |y|  and 
such  that  it  has  a  maximum  matching  of  size  |X|  and  such  that  every  node  is  reachable 
from  one  of  the  jyj  —  |X|  unmatched  nodes  by  an  alternating  path. 

QUESTION:  Is  there  a  valid  PLT  schedule  of  length  <  <? 

DESCENT 

INSTANCE:  A  time  bound  t  and  a  bipartite  graph  G  =  (A’,y,  E)  such  that  |y|  <  jXj  and 
such  that  it  has  a  maximum  matching  of  size  |y|  and  such  that  every  node  is  reachable 
from  one  of  the  jX]  -  |y|  unmatched  nodes  by  an  alternating  path. 

QUESTION:  Is  there  a  valid  PLT  schedule  of  length  <  t? 


PLATEAU 
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INSTANCE:  A  time  bound  t  and  a  bipartite  graph  G  =  {X,Y,  E)  such  that  [Xj  =  IVI  and 
such  that  it  has  a  perfect  matching. 

QUESTION:  Is  there  a  valid  PLT  schedule  of  length  <  t? 

The  plateau  can  be  further  segregated.  To  see  this  consider  the  modified  version  of  the 
graph  which  has  the  edges  from  a  maximum  matching  oriented  from  y  to  X.  Decompose  the 
plateau  into  a  DAG  of  strongly-connected  components  (SCCs)  and  topologically  number 
the  components.  Like  our  APD-decomposition,  this  decomposition  also  won’t  vary  with 
different  maximum  matchings.  To  see  this,  we  again  need  to  consider  what  happens  when 
we  invert  paths  and  cycles.  Any  directed  even  length  path  that  starts  at  an  unmatched  X 
node  or  ends  at  an  unmatched  Y  node  cannot  extend  into  the  plateau,  so  inverting  such 
a  path  doesn’t  affect  the  plateau  at  all.  Any  directed  cycle  is  either  wholly  outside  the 
plateau  or  wholly  inside  a  single  strongly-connected  component  of  the  plateau,  and  thus 
inverting  such  a  cycle  won’t  affect  the  DAG/SCC  structure  of  the  plateau. 

If  the  decomposition  gives  k  components,  then  we  can  do  the  following  fc  — 1  segregations: 
for  t  =  1  to  fc  —  1,  label  components  1  through  i  with  one  and  t  -f-  1  through  k  with  two. 
These  segregations  will  satisfy  Lemma  3.26  and  hence  we  have  not  harmed  anything. 

Figure  3.6  shows  a  sample  decomposition  of  a  graph.  Note  that  one  of  the  two  “SCC”s 
of  the  plateau  is  degenerate:  the  plateau  piece  {®i4»yi9}  only  two  nodes  and  one 
edge. 

PLATEAU-SCC 

INSTANCE:  A  time  bound  t  and  a  bipartite  graph  G  =  {X,Y,E)  such  that  |X|  =  |y|  and 
such  that  it  has  a  perfect  matching  and  such  that  for  any  subset  0  C  X  C  X,  |r  (X  )|  — 
|X'|  <  0. 

QUESTION:  Is  there  a  valid  PLT  schedule  of  length  <  <? 

Lemma  3.84  PLATEAU  =  PLATEAU-SCC. 

Proof:  We  have  already  described  how  to  use  segregation  to  decompose  a  single  instance 
of  PLATEAU  into  at  most  a  linear  number  of  instances  of  PLATEAU-SCC  that  be  can 
solved  independently.  Hence  PLATEAU  ■^■p  PLATEAU-SCC. 

Since  the  PLATEAU  problem  is  the  same  as  the  PLATEAU-SCC  problem  except 
that  the  allowable  instances  of  PLATEAU  are  a  superset  of  the  allowable  instance  of 
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Figure  3.6:  APD-decomposition 
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PLATEAU-SCC,  then  PLATEAU-SCC  :<r  PLATEAU.  | 

Lemma  S.85  ASCENT  =  DESCENT. 

Proof:  Because  of  Corollary  2.6  we  can  start  with  an  instance  of  ASCENT,  let  t'  = 
f  +  |y|  _  |X|,  transpose  its  adjacency  matrix  and  solve  the  problem  as  a  DESCENT  instance. 
Similarly,  we  Can  start  with  an  instance  of  DESCENT,  let  t'  =  t  +  |y|  —  |X|,  transpose  its 
adjacency  matrix  and  solve  the  problem  as  an  ASCENT  instance.  | 


Lemma  8.36  ASCENT  :<t  PLATEAU. 

Proof:  Given  an  instance  of  ASCENT.  Add  lyi-IA"]  new  nodes  toX.  Connect  each  one 
of  these  to  each  node  of  T.  Solve  this  instance  of  PLATEAU  yielding  an  optimal  schedule  5. 
Give  the  label  two  to  all  the  original  nodes  of  X  and  give  the  label  one  to  the  others. 
By  Corollary  3.28,  the  segregated  schedule  S'  will  have  >  dm-  Since  5  was  optimal, 
d'^  =  dm  and  hence  d'm-i  =  dm-i-  Clearly,  so  Lemma  3.30  tells  us  that 

5'  =  [x'a+i , .  •  • .  ®m]  *8  an  optimal  schedule  for  the  graph  induced  by  (5^  \  5^)  U(^m  \  ^/)» 
which  is  just  our  original  instance  of  ASCENT.  | 


Lemma  3.37  PLATEAU  :<r  ASCENT. 

Proof:  Given  an  instance  of  PLATEAU.  Add  one  new  node  y^+i  to  Y-  Connect  it 
to  every  node  Xi  €  X.  Any  optimal  ASCENT  solution  is  clearly  an  optimal  PLATEAU 
solution.  I 

Corollary  3.38  PLATEAU  =  ASCENT  =  DESCENT  =  PLT.  I 
3.3.6  Well-Ordered  Optimum 

These  segregations  have  given  us  a  good  start  on  defining  what  we  will  call  well-ordered 
optimum  schedules.  To  actually  define  them,  we  need  to  introduce  more  terminology.  All 
examples  use  the  graph  in  Figure  3.6. 
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Definition  3.8  A  nonnegative-profit  subschedule  (NFS)  is  a  schedule  [ii, . . ®fc],  k  <  m, 
such  that  Tifc  >  0  and  rij  <  0  for  all  0  <  i  <  A.  For  example  in  Figure  3.6,  the  subsched¬ 
ule  [®5,®6.®7.®8j®9)®i0i®n]  is  an  NFS  whose  chart  drops  all  the  way  down  to  -7  before 
bouncing  back  up  to  0;  thus,  this  is  a  (-7,0)  NFS.  A  minimum-borrowing  nonnegative- 
profit  subschedule  (MBNFS)  is  a  NFS  whose  dk-i  is  no  smaller  than  that  of  any  other 
NFS.  The  charts  for  the  NFSs  [®4,  *3,2:2]  and  [®2,®i]  only  drop  to  -2  and  there  are  no 
NFSs  whose  chart  only  drops  to  —1;  thus  these  are  (—2,  -|-2)  and  (—2,  -1-1)  MBNFSs.  A  mtn- 
imal  minimum-borrowing  nonnegative-profit  subschedule  (MMBNFS)  is  a  MBNFS  which 
is  minimal  in  the  sense  that  there  is  no  strict  subset  of  the  X  nodes  of  the  subschedule 
that  can  be  re-ordered  to  yield  a  nonnegative-profit  subschedule  with  the  same  or  better 
borrowing  then  the  original  subschedule.  The  {—2, +2)  MBNFS  [®4>®3,®2]  is  not  minimal, 
since  it  contains  the  (— 2,-fl)  MBNFS  [®2>®3];  the  subschedules  [®2,®3]  and  [®i,®2]  are 
MMBNFSs.  Finally,  a  minimal  NFS  with  borrowing  b  is  an  NFS  that  is  minimal  for  its  bor¬ 
rowing  b,  even  though  it  may  not  be  a  MBNFS.  The  subschedule  [15, xe,  ®7>  ®8)  ®9i  ®io,  ®ii] 
is  a  minimal  NFS  with  borrowing  —7. 


AMMBNPS 

INSTANCE:  A  time  bound  t  and  a  bipartite  graph  G  =  {X,Y,  E)  such  that  |X|  <  |y|  and 
such  that  it  has  a  maximum  matching  of  size  |X|  and  such  that  every  node  is  reachable 
from  one  of  the  lY"!  —  IX]  unmatched  nodes  by  an  alternating  path. 

FIND:  An  arbitrary  MMBNFS. 

It  turns  out  that  MMBNFSs  must  be  connected: 

Lemma  3.39  Let  5  =  [®i, . . . ,  ®fc]  be  a  (6,p)  MMBNFS  for  a  graph  G.  The  graph  induced 
by  the  nodes  S  U  T'{S)  is  a  connected  graph. 

Proof:  Assume  to  the  contrary  that  5ur'(5)  is  made  up  of  fc  >  1  connected  components 
Cl,...,  Ck.  The  schedule  5  can  be  viewed  as  a  merge  of  k  separate  schedule  S^,...,S^  such 
thiat  5*  only  uses  nodes  from  Ci.  We  will  borrow  the  notations  v,{i,j)  and  n*-  that  were 
used  in  the  proof  of  Lemma  3.14. 

Choose  j'  to  be  the  smallest  number  such  that  there  exists  an  i  with  nj,  >  0  and 
u{i,fi)  >  0.  For  any  j  there  is  at  most  one  i  such  that  n‘-  ^  hence  the  t  corresponding 
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to  our  choice  of  j  =  j'  is  unique,  and  we  will  assume  without  loss  of  generality  that  this  i 
is  equal  to  1. 

Since  6  <  and  E<=2  <  0  for  0  <  y  <  j',  then  6  <  Zti  "j  =  +  E?=2  "j  < 

0  <  j  <  j'- 

What  this  means  is  that  the  subschedule  consisting  of  the  first  nodes  of  the 

schedule  S^  is  an  NPS  with  borrowing  >  h.  Thus  5  is  not  minimal  and  we  have  our 
contradiction.  | 


Definition  3.9  We  call  a  schedule  a  well-ordered  schedule  if  it  satisfies  the  four  constraints: 

1.  The  schedule  is  APD-segregated. 

2.  The  ascent  is  a  greedy  sequence  of  MMBNPS — that  is,  we  find  an  arbitrary  MMB- 
NPS,  schedule  it  first,  and  then  iterate  on  the  remaining  portion  of  the  ascent. 

3.  The  plateau  is  segregated  into  SCCs,  each  of  which  is  optimally  scheduled.  The 
SCCs  appear  in  a  valid  topological  order. 

4.  If  one  looks  at  the  transpose  of  the  descent,  it  is  a  greedy  sequence  of  MMBNPS. 

In  defining  a  well-ordered  optimum,  we  have  used  a  greedy  sequence  of  MMBNPS  s. 
Before  we  show  that  all  well-ordered  schedules  are  optimum  schedules,  we  need  some  helping 
lemmas. 

Lemma  3.40  Given  any  minimal  NPS  of  borrowing  b  then  there  is  schedule  with  borrowing 
=  min(6, -overhang(M))  that  begins  with  the  NPS. 

Proof:  Start  with  an  arbitrary  optimum  schedule  5  =  [ii, . .  ..a:™].  As  a  first  step, 
label  the  X  nodes  that  appear  in  the  minimal  NPS  with  one,  label  the  other  X  nodes  with 
a  two  and  then  segregate  in  the  order  [one,  two]. 

Let  /  be  the  number  of  X  nodes  labelled  one.  Since  the  one  nodes  form  a  minimal 
NPS,  nj  <  0  for  0  <  1  <  /.  Also  n'y  =  p  >  0  where  p  is  the  profit  of  the  NPS.  Hence 
max^Lo"i  =  Lemma  3.24  tells  us  that  >  rn  for  all  Xi  with  label  two. 

The  initial  nodes  comprising  the  NPS  may  not  be  optimally  scheduled,  so  we  do  so  now. 
This  has  no  effect  on  for  all  ij  with  label  two. 
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The  worst  borrowing  for  the  schedule  either:  (1)  occurs  during  the  scheduling  of  the 
one  nodes,  and  hence  is  equal  to  6;  (2)  occurs  during  the  scheduling  of  the  two  nodes,  and 
hence  is  equal  to  -  overhang  (Jlf).  | 


Lemma  3.41  Given  any  MMBNPS.  There  is  optimal  schedule  of  the  ascent  that  begins 
with  the  MMBNPS. 

Proof:  Let  6  be  the  borrowing  of  the  MMBNPS.  Claim:  b  >  -overhang(Af ),  and  hence 
Lemma  3.40  gives  us  our  result. 

To  see  that  b  >  -overhang(M),  start  with  any  schedule  5  of  the  ascent.  The  balance  of 
this  schedule  must  go  positive,  so  the  schedule  has  some  non-empty  initial  NPS.  Say  that  this 
NPS  has  a  borrowing  of  6'.  We  know  that  b'  <  b.  Also,  delay(5)  >  -b'  >  -b.  Furthermore, 
overhang(Af)  =  delay(5)  for  any  optimum  schedule  5.  Hence  overhang(M)  >  —b,  or 
equivalently  b  >  — overhang(M).  I 


Corollary  3.42  Greedy  use  of  AMMBNPS  is  optimal.  | 

Corollary  3.43  ASCENT  :<t  AMMBNPS.  I 
3.3.7  Fundamental  Charts 

Two  different  well-ordered  schedules  may  have  different  charts.  They  may  even  have  differ¬ 
ent  sequences  of  (b,p)  pairs  for  their  ASCENTS.  Yet  there  is  a  certain  fundamental  essence 
that  can  be  abstracted  from  their  charts  that  will  be  identical. 

To  find  the  fundamental  chart  of  a  chart  first  break  it  up  into  the  standard  three  pieces, 
the  ASCENT,  the  PLATEAU  and  the  DESCENT.  Modify  them  as  Mows. 

REPLACE:  For  the  ascent,  replace  each  MBNPS  with  a  single  (6,p)  pair.  For  each 
plateau  SCC,  replace  it  with  a  single  (6,p)  pair.  For  the  descent,  replace  each  MB¬ 
NPS  of  the  transpose  with  a  single  {b,p)  pair.  Thus  since  the  upper  chart  in  Figure  3.7 
is  a  [(-2, -HI),  (-1, -1-2),  (-1, -1-1),  (-7, -1-1),  (-2,0),  (-1,0),  (-2,-1),  (-2,0)],  then  after  re- 
placement  we  get  the  chart  in  the  lower  left  corner. 

COMBINE:  For  the  ascent,  combine  adjacent  (6,p)  pairs  as  much  as  possible:  If  the 
pair  (62,^2)  follows  the  pair  (&i,Pi)  and  pi  —  bi  >  —62  then  we  can  combine  the  two  pairs 
to  yield  (61,  pi  +P2)-  Do  the  same  thing  for  the  transpose  of  the  descent.  For  the  plateau, 
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Figure  3.7:  Chart  and  Fundamental  Chart. 


combine  all  pairs  into  a  single  (6,0)  pair,  with  b  being  the  minimum  b  over  the  all  the  pairs 
in  the  plateau.  These  leaves  us  with  [(-2,  +4),  (-7,  +1),  (-2, 0),  (-2,  - 1)]  as  shown  by  the 
chart  in  the  lower  right  corner  of  Figure  3.7.  This  is  the  fundamental  chart  corresponding 
to  the  original  schedule. 

If  we  restrict  our  consideration  to  the  ascent,  the  Fundamental  Chart  will  swing  up  or 
down  only  when  the  original  chart  of  the  schedule  hits  a  new  high  or  a  new  low. 

Lemma  3.44  Any  two  well-ordered  optimum  schedules  for  the  same  graph  will  have  the 
same  fundamental  chart. 

Proof:  Suppose  we  have  two  well-ordered  optimum  schedules  Si  and  52.  Let  a  fragment 
be  a  section  of  the  schedule  corresponding  to  a  single  (b,p)  pair  in  the  fundamental  chart. 
Let  Fi  and  Fi  be  the  first  fragments  in  the  ascents  of  5i  and  52,  respectively.  Since  the 
borrowing  of  a  fragment  equals  the  borrowing  of  its  first  MMBNPS,  the  two  fragments  have 
the  same  borrowings  6i  =  62-  Let  pi  and  p2  be  the  profit  associated  with  fragments  Fi  and 
F2,  respectively.  If  we  can  just  show  that  they  use  the  exact  same  sets  of  X  nodes,  then  it 
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will  be  clear  that  pi  =  P2-  Also,  if  they  use  the  exact  same  sets  of  X  nodes,  they  must  leave 
the  same  remaining  nodes.  Thus  we  can  just  iterate  this  analysis  on  the  remaining  nodes, 
and  hence  the  entire  ascents  have  identical  fundamental  charts. 

Transposition  implies  that  the  descents  of  the  schedules  also  have  identical  fundamental 
charts.  Lastly,  it  is  obvious  that  the  plateaus  have  the  same  fundmental  charts  and  hence 
the  entire  fundamental  charts  are  identical. 

So  all  we  need  to  do  is  show  that  Fi  and  F2  use  the  same  sets  of  X  nodes.  Without  loss 
of  generality,  assume  that  pi  <  P2«  In  '^2)  label  all  X  nodes  used  by  Fi  with  the  label  one 
and  label  all  other  X  nodes  with  the  label  two.  Segregate  52  in  [one,  two]  order.  Let  /  equal 
the  number  of  X  nodes  labelled  with  one,  that  is  /  =  Let  g  =  \Fi  Since  =  pi 

and  rig  =  P2j  then  <  rig  and  hence  53  =  ^2  \  Fi  is  an  NPS  in  G  \  (Fi  U  T\Fi)),  Since  Fi 
is  comprised  of  MMBNPSs,  max/_QnJ  =  n^.  Thus,  Lemma  3.24  teUs  us  that  ^  for 
all  Xi  with  label  two.  Hence  if  53  ^  0,  the  borrowing  of  53  is  >  +  Pi  and  53  must  contain 

an  MMBNPS  with  borrowing  >  6  +  pi.  This  is  impossible,  since  such  a  MMBNPS  would 
have  been  included  in  i*i,  and  hence  53  ==  0.  Since  53  =  0,  F2  C  Fi  and  Hence 

Pi  =  p2.  Since  pi  =  P2,  we  can  now  use  a  symmetric  argument  to  show  that  Fi  C  F2.  I 


3.3.8  Nearly  Well-Ordered  Optimum  Schedules 

It  turns  out  that  there  are  also  non- well-ordered  schedules  for  a  graph  that  have  the  same 
fundamental  chart  as  all  the  well-ordered  schedules.  We  will  define  a  nearly  well-ordered 
schedule  as  any  schedule,  well-ordered  or  non- well-ordered,  that  has  the  same  fundamental 
chart  as  all  the  well-ordered  schedules. 

We  will  show  that  the  following  two  problems  are  equivalent  under  polynomial- time 
reductions. 

Find  an  arbitrary  optimum  schedule  (FIND-ARB-OPT) 

INSTANCE:  A  bipartite  constraint  graph. 

FIND:  An  arbitrary  optimum  schedule. 

Find  a  nearly  well-ordered  optimum  schedule  (FIND-NEAR- WELL-OPT) 
INSTANCE:  A  bipartite  constraint  graph. 

FIND:  A  nearly  well-ordered  optimum  schedule. 
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Since  the  set  of  nearly  well-ordered  optimum  schedules  are  a  subset  of  the  arbitrary 
optimum  schedules: 

Lemma  3.46  FIND-ARB-OPT  :<t  FIND-NEAR-WELL-OPT.  I 
To  see  the  other  direction,  let’s  first  examine  the  problem: 

Determine  the  fundamental  chart  (FUND-CHART) 

INSTANCE:  A  bipartite  constraint  graph. 

FIND:  The  fundamental  chart. 

Lemma  3.46  FUND-CHART  ^■p  FIND-ARB-OPT. 

Proof;  We  are  given  a  graph  and  we  want  to  determine  its  fundamental  chart. 

First  we  will  find  the  fundamental  chart  of  the  ascent.  Do  the  foUowing  with  k  running 
from  1  to  n  -b  1.  Add  a  new  component  Kk,n+i  to  the  original  graph,  with  the  smaller  side 
of  the  new  complete  graph  in  X.  Find  an  arbitrary  optimum  solution  to  the  new  graph; 
that  is,  call  a  polynomial  time  FIND-ARB-OPT  routine.  Plot  the  resulting  overhangs  as  a 
function  of  k]  call  this  plot  the  scan  of  the  graph.  Figure  3.8  shows  the  scan  for  the  ascent 
of  the  graph  in  Figure  3.6. 

It  turns  out  that  we  can  read  the  fundamental  chart  right  off  the  scan.  Look  at  the 
horizontal  sections.  For  example,  look  at  the  horizontal  section  from  k  =  2  to  k  =  6.  This 
means  that  when  adding  ifa.n+i)  A’4,„+i,  ffB.n-t-i  or  A^e.n+i.  there  must  be  a  NPS  made 
from  nodes  in  the  original  graph  that  preceded  the  Kk,n+i  components  in  the  schedule. 
This  NPS  must  have  had  a  borrowing  of  2  and  a  profit  of  4.  This  generalizes.  If  there  is  a 
horizontal  section  from  ki  to  fc2,  then  the  fundamental  chart  has  a  (6,p)  =  (-ki, *2  -  Aji) 
component.  Thus  the  fundamental  chart  for  the  example  in  Figure  3.8  is  [(-2,4)(-7, 1)]. 

This  carries  over  to  finding  the  fundamental  chart  for  the  transpose  of  the  descent.  For 
the  plateau,  there  will  be  no  horizontal  sections  in  the  scan.  This  can  be  fixed  by  adding  a 
new  y  node  and  connecting  it  to  all  the  original  X  nodes.  When  we  find  that  this  modified 
graph  is  of  type  (6, 1)  for  some  6,  then  we  know  that  the  original  plateau  is  of  type  (i>,  0). 

I 

Let  the  first  horizontal  section  be  from  ki  to  k2-  Let’s  look  at  the  schedule  that  was 
generated  when  we  added  the  Kk3.n+i  piece  and  called  the  FIND-ARB-OPT  routine.  Let’s 
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focus  on  the  part  of  the  schedule  just  up  to  when  the  n  +  1  Y  nodes  of  the  iiTfcz.n+i  are 
released.  Label  the  Kkj, n+i  piece  with  two  and  label  all  the  X  nodes  that  occur  before 
the  last  X  node  of  the  Kkj,n+i  piece  with  one.  Segregate  in  [one,  two]  order.  The  result 
is  a,  (—ki,k2  —  ki)  piece  followed  by  a  (— A:2>”'  +  1  ~  ^2)  piece.  How  do  we  know  that  the 
first  piece  is  a  {-kuk2  -  ki)  piece?  The  piece  can’t  have  borrowing  >  -ki,  because  then 
the  first  horizontal  section  of  the  scan  would  have  started  earlier.  The  piece  can’t  have 
borrowing  <  -Jbi,  because  the  (-A:2,n  +  1  -  *2)  piece  doesn’t  contribute  a  positive  amount 
to  the  chart  value  until  the  end,  and  the  overall  borrowing  is  only  fci.  If  the  piece  had  profit 
>  k2-  ki,  then  the  first  horizontal  section  of  the  scan  would  continue  past  fc2-  Lastly,  if 
the  profit  were  <k2-ki,  then  the  {-k2,n-\- 1  -  k2)  piece  would  cause  a  borrowing  < -k^. 

The  {-ki,k2  -  ki)  piece  gives  us  the  first  fragment  of  a  nearly  well-ordered  optimum 
schedule.  Then  we  can  simply  delete  all  those  nodes  from  the  graph  and  repeat.  Hence: 

Corollary  3.47  FIND-NEAR- WELL-OPT  :<t  FIND-ARB-OPT.  I 
3.3.9  Merge 

With  what  we  now  know  about  weU-ordered  schedules  we  can  return  to  the  question  that 
started  this  section.  Is  there  a  simple  method  of  combining  independent  optimum  schedules 
of  disconnected  subgraphs  into  an  optimum  schedule  for  the  whole  graph?  We  will  show 
that  well-ordered  schedules  are  easier  to  merge  than  arbitrary  optimum  schedules.  We  will 
show  this  by  considering  the  following  three  problems. 

MERGE- ARB 

INSTANCE:  Two  separate  constraint  graphs  and  arbitrary  optimal  schedules  for  them. 
FIND:  An  arbitrary  optimal  schedule  for  the  combined  graph. 


MERGE-NEAR 

INSTANCE:  Two  separate  constraint  graphs  and  nearly  well-ordered  optimal  schedules  for 
them. 

FIND:  A  nearly  well-ordered  optimal  schedule  for  the  combined  graph. 
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MERGE- WELL 

INSTANCE:  Two  separate  constraint  graphs  and  well-ordered  optimal  schedules  for  them. 
FIND:  A  weU-ordered  optimal  schedule  for  the  combined  graph. 

We  will  show  that  MERGE-WELL  and  MERGE-NEAR  are  quite  easy.  However, 
MERGE-ARB  is  as  hard  as  the  problem  of  finding  a  nearly  well-ordered  schedule  and 
hence  can  only  be  solved  easily  if  PLT  can. 

Lemma  3.48  FIND-NEAR- WELL-OPT  :<r  MERGE-ARB 

Proof:  The  proof  runs  exactly  like  Lemma  3.46  and  CoroUary  3.47,  except  that  we 
replace  calls  to  FIND-ARB-OPT  with  calls  to  MERGE-ARB.  Everything  else  goes  through 
the  same  way.  I 

Corollary  3.49  MERGE-ARB  =  PLT.  I 

What  about  MERGE- WELL?  Once  again,  the  problem  can  be  partitioned  into  ascent, 
plateau  and  descent  pieces.  The  ascent  of  the  merged  schedules  will  be  a  merge  of  the 
two  ascents.  The  plateau  of  the  merged  schedules  will  be  a  merge  of  the  two  plateaus.  To 
merge  the  two  descents  we  transpose  them  into  ascents,  merge  them  and  then  transpose 
them  back. 

How  do  we  merge  two  well-ordered  ascents?  Corollary  3.42  tells  us  that  greedy  use 
of  AMMBNPS  is  optimal.  Fortunately,  Lemma  3.39  tells  us  that  a  MMBNPS  must  be  a 
connected  piece  and  hence  any  MMBNPS  of  the  combined  graph  is  an  MMBNPS  of  one 
of  the  ascents.  Look  at  the  first  MMBNPS  of  each  of  the  two  schedules.  They  are  easy 
to  find  since  they  are  already  right  at  the  beginning  of  the  schedules.  They  are  of  some 
types,  say  (&i,pi)  and  {b2,P2)-  If  hi  >  ^2  then  by  greedy  use  of  AMMBNPS  we  can  choose 
the  (hi.pi)  piece  as  the  first  MMBNPS  of  our  new  merged  schedule;  otherwise  choose  the 
(62,  pj)  piece  as  the  first  MMBNPS.  After  removing  the  chosen  MMBNPS,  iterate  on  the 
remaining  portion  of  the  graph. 

The  optimal  merge  of  two  weU-ordered  plateaus  is  even  simpler.  One  only  needs  to 
concatenate  the  two  schedules  without  interweaving.  This  is  because  all  MMBNPSs  have 
profit  =  0  and  each  (6,0)  MMBNPS  will  dip  to  the  same  chart  level  relative  to  the  peak 
balance  regardless  of  the  order  of  the  MMBNPSs. 
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For  example,  given  the  following  two  well-ordered  schedules. 

[(-2, 0),  (-1,2),  (-2,0),  (-2,0),  (-5,  -l),(-5,  -3)] 
[(-l,l),(-3,l),(-l,0),(-3,0),(-4,-l),(-4,-3)]. 

The  (6,p)  pairs  up  to  and  including  the  last  one  with  a  positive  p  comprise  the  ASCENT. 
The  (6,p)  pairs  starting  from  and  including  the  first  one  with  a  negative  p  comprise  the 
DESCENT.  The  (6,0)  pairs  left  in  the  middle  comprise  the  PLATEAU.  So,  one  possible 
well-ordered  merge  is: 

[(-l,l),(-2,0),(-l,2),(-3,l),(-2,0),(-2,0),(-l,0),(-3,0), 

(_5,-1),(-4,-1),(-5,-3),(-4,-3)] 

What  about  MERGE-NEAR?  It  turns  out  that  we  can  use  the  same  algorithm  used  for 
MERGE- WELL. 

3.4  Weighted  Chains 

We  can  generalize  “charts”  to  handle  weighted  nodes  directly.  The  reduction  in  Section  2.3 
was  mainly  to  show  equivalence.  It  is  unlikely  that  someone  faced  with  the  weighted  problem 
would  wish  to  convert  it  to  the  unit- weight  version  before  actually  solving  it. 

We  generalize  the  definitions  of  rij  and  in  Section  3.2  to: 


ni  =  S 

y€Ti  xeSi 

fii  = 

V€Tj  *e5i+i 

So  the  graph  in  Figure  2.15  with  the  schedule  Sx  =  [a:i>  ®2,  ®4,  *3]  will  have  the  chart 

shown  in  Figure  3.9. 

The  algorithm  for  merging  schedules  based  on  their  charts  remains  the  same,  so  we  can 
merge  weU-ordered  schedules  of  weighted  graphs  in  polynomial  time.  This  serves  as  the 
basis  for  a  polynomial  time  algorithm  to  schedule  weighted  chains. 

A  chain  is  a  string  of  nodes  as  shown  in  Figure  3.10. 

The  algorithm  to  schedule  chains  is  based  on  dynamic  programming.  We  will  compute 
the  optimum  schedule  for  every  connected  subgraph  of  the  chain.  These  correspond  to 
intervals  of  the  chain.  There  are  0((Tn  -|-  n)^)  such  intervals. 
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Xi  X5  X2  X4  X3 


Figure  3.9:  Chart  for  a  weighted  graph. 


Figure  3.10:  A  chain. 

We  compute  these  schedules  in  a  particular  order  and  by  a  particular  method:  We  first 
compute  the  schedule  for  all  connected  subgraphs  of  size  1,  then  size  2,  then  size  3,  and 
so  on.  For  a  particular  graph  with  k  X  nodes,  we  compute  its  schedule  as  follows:  We 
will  compute  k  different  schedules  one  of  which  is  guaranteed  to  be  a  nearly  well-ordered 
optimum;  then,  we  pick  out  the  nearly  weU-ordered  optimum  from  this  set. 

The  k  different  schedules  correspond  to  the  fact  that  the  schedule  must  start  with  some 
one  of  the  X  nodes.  We  can  easily  compute  the  best  schedule  given  that  a  particular 
node  X  is  first:  After  removing  i,  what  is  left  is  one  or  two  smaller  connected  pieces.  We 
have  already  computed  nearly  well-ordered  optimums  for  these  pieces.  By  merging  them, 
we  get  a  nearly  well-ordered  optimum  for  the  whole  graph  piece  minus  the  node  x. 

By  the  suffix  law  of  Corollary  3.51,  if  some  nearly  well-optimum  for  the  piece  begins 
with  X,  then  the  suffix  of  the  optimum  schedule  can  be  any  well-ordered  optimum  schedule 
of  the  remaining  graph.  Thus  by  concatenating  the  merge  of  the  pieces  to  x,  we  will  get 
a  well-ordered  optimum  for  the  piece,  assuming  there  is  a  well-ordered  optimum  beginning 
with  X.  That  is: 

Lemma  3.60  Given  that  there  is  a  well-ordered  optimum  schedule  starting  with  the  node  i. 
Let  5  =  {Sx,Sy)  be  an  arbitrary  well-ordered  optimum  schedule  for  G\x.  Then  5  = 
([x,  5x],  -Sy )  is  a  well-ordered  optimum  schedtile  for  G.  I 

Corollary  3.51  Given  that  there  is  a  nearly  well-ordered  optimum  schedule  starting  with 
the  node  x.  Let  5  =  {Sx,Sy)  be  an  arbitrary  nearly  well-ordered  optimum  schedule  for 
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G\x.  Then  5  =  ([a:,  5jr],  Sy)  is  a  nearly  well-ordered  optimum  schedule  for  G.  I 

Since  we  construct  one  such  schedule  for  each  x,  and  some  x  must  be  first  in  the  nearly 
well-ordered  optimum,  at  least  one  of  the  k  schedules  is  our  nearly  well-ordered  optimum 
schedule.  AH  we  need  to  do  is  pick  it  out  and  we  are  done. 

So  this  brings  up  the  new  sub-problem: 

Select  a  nearly  well-ordered  optimum  (SELECT-NWO-OPT) 

INSTANCE:  A  set  of  schedules  that  is  guaranteed  to  contain  a  nearly  well-ordered  optimum 
schedule. 

FIND:  A  member  of  the  set  that  is  a  nearly  well-ordered  optimum  schedule.. 

Since  we  just  have  to  pick  an  optimum  out  from  a  set  of  schedules  which  is  guaranteed 
to  contain  one,  we  only  need  to  be  able  to  make  comparative  evaluations:  “this  schedule  is 
more  nearly  well-ordered  than  this  other  schedule.” 

So  how  do  we  pick  out  a  nearly  well-ordered  optimum  schedule  from  amidst  a  line-up 
of  fakers?  For  example,  say  that  our  set  of  schedules  is: 

51  =  [(-2,2),(-3,2),(-2,0),(-l,0),(-3,-2)] 

52  =  [(-2,2),(-3,2),(-3,0),(-l,0),(-3,-2)] 

53  =  [(-3,2),(-2,2),(-2,0),(-l,0),(-3,-2)] 

54  =  [(-2,2),(-3,l),(-2,0),(-l,0),(-3,-l)] 

55  =  [(-2,2),(-3,2),(-l,0),(-2,0),(-3,-2)] 

Divide  the  schedules  into  ascent,  plateau  and  descent.  We  can  discard  a  schedule  if 
its  plateau  dips  further  negative  relative  to  the  peak  balance  than  some  other  schedule.  So 
in  the  example,  we  can  discard  52. 

Now  we  just  have  to  check  the  ascents  (and  the  transposed  descents).  Assume  that 
we  have  two  alternate  schedules  5  and  S'.  Assuming  they  are  nearly  well-ordered  we 
can  easily  determine  their  fundamental  charts  and  break  them  up  into  their  fragments; 
5  =  [Fi,  Fz,  Fa, . . .]  and  S'  =  [F(,  F^,  F^, . . .].  Corresponding  to  each  fragment  is  a  (f»i,pi) 
or  pair.  In  our  example,  we  now  have: 


5i  =  {(-2, 4),  (-2,0),  (-3, -2)] 
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53  =  [(-3, 4),  (-2,0),  (-3, -2)] 

54  =  [(-2, 3),  (-2,0),  (-3,-1)] 

55  =  [(-2,4),  (-2,0),  (-3, -2)1 

For  t  increasing  from  1  perform  the  following  comparison  procedure:  If  hi  >  then  we 
can  discard  S'  and  stop.  If  h-  >  bi  then  we  can  discard  5  and  stop.  So  =  6-.  If  pi  >  p- 
then  we  can  discard  S'  and  stop.  If  p-  >  Pi  then  we  can  discard  5  and  stop. 

Lemma  8.62  If  both  schedules  survive  the  comparison  procedure,  and  either  one  is  a 
nearly  well-ordered  schedule,  then  the  other  is  too. 

Proof:  Easy.  Both  survive  if  and  only  if  they  have  the  same  fundamental  chart.  | 

In  our  example,  only  schedules  Si  and  survive. 

Since  all  nearly  well-ordered  schedules  will  survive,  since  there  exists  at  least  one  well- 
ordered  schedule,  and  since  the  surviving  schedules  are  either  all  nearly  well-ordered  or  all 
not  nearly  well-ordered,  then  all  the  surviving  schedules  are  nearly  well-ordered.  Thus  we 
can  arbitrarily  pick  any  one  of  them. 


Chapter  4 


One,  Two,  Three,  Approximation 


4.1  Nodes  with  degree  =  1 

There  is  a  simple  rule  for  any  Y  node  with  degree  =  1,  that  is,  for  any  node  y  with 
r(y)  =  {a:}.  We  can  schedule  the  node  x  immediately.  We  can  do  this  because  scheduling 
X  has  a  borrowing  of  1  and  a  non-negative  profit;  it  is  (—1,0)  and  hence  it  is  an  MMBNPS 
and  we  can  schedule  it  next  by  Corollary  3.42.  If  this  creates  more  Y  nodes  of  degree  one, 
then  we  can  repeatedly  apply  this  rule.  By  transposition,  we  can  apply  a  similar  rule  when 
we  have  an  X  node  of  degree  1. 

4.1.1  Scheduling  trees  in  polynomial  time 

A  consequence  of  this  is  that  we  can  handle  graphs  whose  underlying  constraint  graph, 
ignoring  the  directions  on  the  precedence  arcs,  is  a  tree.  We  can  schedule  a  tree  as  fol¬ 
lows;  As  long  as  there  is  a  leaf  node  that  is  a  T  node,  schedule  its  A-neighbor  next.  This 
may  completely  schedule  the  tree,  or  it  may  leave  us  a  tree  in  which  all  the  leaves  are  X 
nodes.  In  the  latter  case,  we  will  have  a  partial  schedule  {Sx,Sy)-  So  for  the  example 
in  Figure  4.1,  we  may  get  a  schedule  ([xi,X2,X6][0,yi,y2>y3,y6])-  If  our  remaining  tree 
has  only  X  nodes  as  leaves,  then  we  treat  the  remainder  as  a  separate  problem  by  trans¬ 
posing  it  and  recursing.  This  will  return  us  a  schedule  (Sy^Sx)  for  the  transposed  tree. 
So  for  the  example  in  Figure  4.1,  we  may  get  ([y6)y4].[0,®5,*4i®3])-  Easily  construct  a 
schedule  for  the  overall  graph  as  ([5x,rev(5^)],  [5y,rev(Sy)]),  so  in  our  example,  we  get: 
([xi,  l2,  ®6,  ®3)  ®4>  ®5,  0],  [0>  yi>  y2,  3/3,  3/6,  3/4,  l/s])- 
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Figure  4.1:  Scheduling  a  tree  in  polynomial  time 


4.2  Nodes  with  degree  =  2 

The  degree  =  1  rule  that  we  just  used  suggests  a  degree  =  2  rule.  Say  that  we  have  a  node  y 
with  degree  2  and  r(j/)  =  Once  we  schedule  one  of  these  two  nodes  —  without 

loss  of  generality  say  ®i  —  then  y  becomes  a  degree  1  node.  Once  Xi  has  been  scheduled 
and  removed  from  the  graph,  scheduling  12  n®xt  becomes  a  (—1,0)  MMBNPS  and  hence  is 
optimal  by  Corollary  3.42. 

So  the  general  rule  is: 


Lemma  4.1  Given  a  schedule  Sx  =  [®i>-  ••)®n]  and  a  node  y  such  that  r(y)  = 
t  <  j,  transforming  the  schedule  into  Sx  —  [®i)  •  •  •  • -^m]  is 

delay  non-increasing.  | 


4.2.1  Collapsing  Transformation 

The  lemma  leads  us  to  consider  a  transformation  where  we  collapse  together  the  two  X  nodes 
and  discard  the  Y  node  as  shown  in  Figure  4.2.  Define  the  Collapsing  Transformation  as: 

•  Replace  a  degree  2  node  y  and  its  two  neighbors  ®i  and  ®2  with  a  single  node  x'  such 

that  r(®')  =  (r(®i)  u  r(®2))  \  y- 

Under  the  right  conditions,  this  transformation  will  not  affect  the  overhang  of  the  graph. 
Under  the  wrong  conditions,  such  as  in  the  two  graphs  in  Figure  4.3,  the  Collapsing  Trans¬ 
formation  does  affect  the  overhang. 
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Figure  4.2:  Collapsing  Transformation  applied  to  y 


yi 

y2 


Xi 


• - o  yi 


yi 


Xi  • 


Figure  4.3:  Bad  Collapsing  Transformations 


4.2.  NODES  WITH  DEGREE  =  2 


77 


X1  *2  *3  X  X3 

Figure  4.4:  Effect  of  constrained  Collapsing  Transformations 

The  following  lemma  spells  out  the  proper  constraints  to  ensure  that  our  transformation 
doesn’t  change  the  overhang  of  the  graph. 

Lemma  4.2  Under  the  constraints  that  r(ii)  D  r(x2)  =  {y}  aJid  |r(xi)  U  r(i2)|  >  2,  the 
Collapsing  Transformation  leaves  the  overhang  of  the  graph  unchanged. 

Proof:  By  Lemma  4.1  there  is  an  optimum  schedule  of  the  original  graph  that  has  xi 
and  X2  consecutive.  Create  an  ordering  for  the  new  post-collapsing  graph  that  schedules  x 
in  the  place  of  where  xi  and  X2  were.  We  claim  that  this  ordering  has  the  same  delay  as  the 
optimal  schedule  of  the  original  graph.  To  see  this,  we  just  need  to  look  at  the  two  cases 
shown  in  Figure  4.4: 

•  Case  1:  Due  to  their  position  in  the  schedule,  either  xi  or  X2  releases  at  least  one  job 
other  than  y,  say  y2.  Since  r(xi)  H  r(x2)  =  {y}>  Vi  can  be  released  by  executing  the 
appropriate  one  of  the  two  x  jobs  first.  Without  loss  of  generality  assume  that  it  is 
xi.  In  this  case  the  original  schedule  could  have  scheduled  xi  before  X2  and  thus  the 
minimum  borrowing  of  the  chart  is  clearly  not  affected  by  the  transformation. 

•  Case  2:  Due  to  their  position  in  the  schedule,  neither  ii  nor  12  releases  a  job  other 
than  y.  Since  |r(xi)ur(x2)|  >  2,  X2  and  Xi  cannot  be  the  last  X  jobs  scheduled;  they 
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yz 

yi 

ya 

y4 

must  be  followed  by  some  other  job,  say  13.  Again,  the  chart  is  clearly  not  affected 
by  the  transformation. 

This  shows  that  the  collapsed  version  has  a  delay  no  better  than  the  original.  To  see 
that  it  is  no  worse  note  that  we  could  just  replace  a:  by  and  ®2  and  yield  a  schedule  for 
the  original  graph  that  is  no  worse  than  the  one  that  we  already  know  is  optimal.  | 

So,  naturally,  repeating  this  constrained  Collapsing  Transformation  as  many  times  as 
we  want  will  not  change  the  overhang  of  the  graph. 
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Figure  4.5:  Expanding  nodes  into  degree  3  nodes 


4.3  Nodes  with  degree  <  3 

There  is  no  apparent  way  to  extend  this  to  collapsing  degree  3  nodes.  Instead  what  we 
will  do  in  this  section  is  look  at  a  reverse  of  the  Collapse  Transformation  from  the  previous 
section,  and  we  will  simultaneously  explore  the  question:  how  hard  is  it  to  optimally  schedule 
graphs  whose  nodes  all  have  degree  <  3? 

4.3.1  Expanding  Transformation 

What  does  the  reverse  of  the  CoUapse  Transformation  look  like?  An  example  is  shown  in 
Figure  4.5.  Say  in  a  graph  G  we  have  a  node  ®i  of  degree  4  with  r(xi)  =  {yi,  j/2>  ^3*  To 
yield  the  graph  G*,  we  remove  the  node  ®i  and  replace  it  with  the  two  nodes  x[  and  ,  and 
we  create  a  new  node  y[.  We  add  edges  so  that  r(xi)  =  {yijy2yyi}  and  r(xi)  =  {y3)!/4)yi}- 
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Since  r(yi)  =  {li.ii'}  and  r(ii)  n  r(a:i')  =  {y[}  and  |r(a:i)  U  r(ii')l  >  2,  then  we  could 
clearly  apply  the  CoUapsing  Transformation  to  y'  and  G'  yielding  the  original  graph  G .  Thus 
the  two  graphs  G  and  G'  have  the  same  overhang  by  Lemma  4.2.  The  only  constraints  for  the 
Expanding  Transformation  are  that  |r(ii)|  >  1  and  that  we  ensure  that  r(a:^)nr(a:j )  =  {y^} 
and  r(xi)  U  r(ii')  =  r(i)  U  {y'}. 

Lemma  4.S  Under  the  constreunts  that  r(ii)  0  T{x")  =  {y'}  and  r(ii)  U  r(ii)  =  r(®)  U 
{y'},  the  Expanding  Transformation  leaves  the  overhang  of  the  graph  unchanged. 

Proof:  By  Lemma  4.2,  if  we  start  with  G'  and  collapse  y',  yielding  G,  we  will  not  have 
changed  the  overhang  of  the  graph.  Thus  G  and  G'  have  the  same  overhang.  | 

4.3.2  Three  ones  per  row 

We  have  seen  that  it  is  easy  to  solve  the  PLT  problem  when  we  have  two  Is  per  row.  What 
about  threes  Is  per  row?  four  Is  per  row?  In  this  section  we  will  look  at  the  special  case 
of  having  at  most  three  Is  per  row.  We  will  show  that  this  special  case  is  as  hard  as  the 
original  problem.  Even  the  restriction  “at  most  three  Is  per  row  and  at  most  three  Is  per 
column”  is  as  hard  as  the  original  problem. 

Lemma  4.4  Any  n  x  n  matrix  M  with  <  A:  Is  per  column,  can  be  reduced  to  an  equivalent 
2n  X  2n  matrix  M'  with  <  (|■A:/2]  +  1)  Is  per  column.  By  being  “equivalent”  we  mean  that 
they  have  the  same  overhang. 

Proof:  Choose  any  X  node  (that  is,  a  column)  with  degree  >  \k/2]  +  1.  Apply  the 
Expanding  Transformation  to  it,  creating  new  nodes  x'  and  x"  such  that  |r(x')|  <  \k/2]  + 1 
and  |r(x")|  <  [A:/2]  +  1.  We  can  always  do  this,  since  |r(x')|  +  |r(x")|  =  |r(x)|  + 1  <  fc  +  1. 

Each  transformation  adds  one  row  and  one  column  to  the  matrix  and  preserves  the 
overhang.  There  need  be  at  most  n  transformations.  | 

Corollary  4.6  Any  n,x  n  matrix  with  <  n  Is  per  column,  can  be  reduced  to  an  equivalent 
n*  X  n®  matrix  with  <  3  Is  per  column.  | 

Corollary  4.6  Any  n  x  n  matrix  with  <  n  Is  per  row  and  <  n  Is  per  column  can  be 
reduced  to  an  equivalent  x  matrix  with  <  3  Is  per  row  and  <  3  Is  per  column.  I 

Figure  4.6  shows  an  example  where  we  go  from  4  Is  per  row  to  3  Is  per  row. 
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Figure  4.6:  Prom  4x  to  3x  per  row  and  column 

4.4  Polynomial  Solutions  to  Overhang- Limited  PLATEAUs 

In  this  section  we  describe  an  algorithm  that  can  solve  PLATEAU-SCC  with  a  fixed  bound 
of  t  =  A:  and  consequently  AMMBNPS  with  borrowing  <  k  in  time  The  work 

was  inspired  by  work  on  the  related  problem  known  as  Bandwidth  Minimization  [GGJK78] 
[Sax80]  [GS84].  One  interesting  consequence  of  this  result  is  that  we  can  solve  the  PLT3 
problem  in  polynomial  time  for  any  graph  whose  nodes  are  all  of  degree  exactly  equal  to 
three. 

Since  we  will  deal  with  PLATEAU-SCC,  the  chart  will  never  go  above  0.  Since  we  are 
only  interested  in  schedules  with  a  bound  oit  =  k,  the  chart  can  never  go  below  k.  It  is 
this  tight  constraint  on  chart  values  that  allows  us  to  have  a  polynomial  time  algorithm. 

Note  that  we  are  only  interested  in  finding  some  schedule  with  delay  <  k,  not  necessarily 
an  optimum  one.  An  optimum  one  can  be  found  by  running  this  algorithm  multiple  times 
with  different  values  for  k. 

Here  is  how  it  works: 

At  phase  i,  the  set  of  active  schedules  can  be  depicted  as  a  rooted  tree  of  depth  t.  The 
edges  of  the  search  tree  will  have  the  names  of  X  nodes  attached  to  them.  Every  active 
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Figure  4.7:  The  Input  Graph 
0 


-1  -1  -1  -1 
Figure  4.8:  Phase  1 


leaf  of  the  graph  represents  an  active  alternative  sub-schedule  for  the  graph.  That  sub¬ 
schedule  is  simply  the  sequence  of  edge  labels  along  the  path  from  the  root  of  the  tree  to 
the  leaf.  The  nodes  will  have  numbers,  representing  chart  levels,  attached  to  them. 

For  example,  let  the  graph  in  Figure  4.7  be  our  input.  The  search  tree  is  expanded 
breadth  first.  So  the  search  tree  after  phase  1  is  shown  in  Figure  4.8.  This  means  that  our 
four  active  sub-schedules  are  [A],  [jB],  \C]  and  [£>].  Each  has  a  resulting  balance  of  -1. 

At  the  next  phase,  each  active  leaf  is  expanded  in  all  possible  ways  to  yield  sub-schedules 
that  are  one  node  longer.  Thus  at  phase  2,  we  have  the  tree  in  Figure  4.9.  Fortunately,  at 
this  point  we  can  prune  the  tree.  Without  any  pruning,  the  algorithm  would  require  0(n!) 
time. 

Here  is  how  the  pruning  works:  Since  the  leaf  corresponding  to  5  =  \A,B\  rises  back 
up  to  a  chart  level  that  was  achieved  by  a  prefix  of  it,  namely  S'  =  [A],  then  5  \  5Ms  a 
nonnegative  profit  sub-schedule  of  G  \  S'.  Indeed,  it  is  a  shortest  nonnegative  profit  sub¬ 
schedule  of  G  \  5'  and  hence  a  minimal  NPS.  Let  b  be  the  lowest  chart  level  reached  by  5 
and  b'  be  the  final  chart  level  reached  by  S'.  Lemma  3.40  tells  us  that  there  is  some  schedule 
of  G  \  5'  that  starts  with  S\S'  and  has  borrowing  of  min(6  -  b',  -overhang(G  \  5')).  Thus, 
except  for  the  path  from  S'  to  S  in  the  search  tree,  we  can  prune  all  other  descendants  of 
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-1  -2  -2  -1  -1  -2  -2  -1  -2  -2  -2  -2 

Figure  4.9:  Phase  2 
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-1  -2  -1  -2  -1  -2  “2  -2  -2  -1  -2  -1 
Figure  4.10:  Phase  3 

the  node  corresponding  to  S'.  This  is  why  the  nodes  corresponding  to  [^4,  C}  and  [A,  D\  are 
filled  in  in  black.  Note  that  Lemma  3.40  tells  us  that  the  choice  between  saving  [B,  A]  and 
\B,  C]  can  be  made  arbitrarily. 

The  next  phase  is  shown  in  Figure  4.10.  We  have  again  expanded  aU  active  leaves.  This 
time  we  have  a  larger  example  of  pruning.  S  =  \D,B,C]  climbs  back  up  to  the  chart  level 
achieved  by  S'  =  [i?].  This  allows  us  to  prune  5  leaves  and  2  interior  nodes. 

The  algorithm  continues  in  this  manner  until  completion  (only  one  more  phase  for 
our  example).  Because  of  the  pruning  described  above,  if  a  leaf  node  is  labelled  with 
borrowing  h,  then  only  |6|  of  its  ancestors  can  currently  have  multiple  children.  Let’s  add 
one  more  pruning  rule:  If  a  leaf  is  labeled  with  —k,  then  mark  that  leaf  as  inactive.  We 
can  do  this  because  extending  the  sub-schedule  any  further  will  violate  the  given  bound  on 
borrowing. 

The  result  of  all  this  is  that  there  are  never  more  than  active  leaves  at  the  end  of 
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any  phase.  This  implies  that  no  more  than  nodes  are  ever  generated.  Allowing  0{v}) 
time  to  generate  each  node,  our  overall  time  is 

4.4.1  3-Regular  Graphs 

As  we  mentioned  at  the  beginning  of  the  section,  a  consequence  of  this  result  is  that  if 
the  nodes  of  a  graph  are  oil  of  degree  excictly  equal  to  three,  then  we  can  solve  the  PLT3 
problem  in  polynomial  time. 

This  is  because  of  the  following  lemma: 


Lemma  4.T  Each  connected  component  of  a  A:-regular  graph  is  a  single  PLATEAU-SCC 
problem. 

Proof:  There  is  a  perfect  matching  of  the  graph,  and  hence  every  component  is  part  of 
the  plateau,  but  we  have  made  an  even  stronger  claim:  that  every  connected  component  is 
a  single  PLATEAU-SCC  piece.  To  see  this,  assume  without  loss  of  generality  that  we  have 
two  PLATEAU-SCCs  (?i  =  {Xi.ii}  and  G2  =  {^’2,72}  that  are  connected  by  an  edge  ci 
from  Xi  to  Y2.  Since  Gi  and  G2  are  PLATEAU-SCC  pieces,  |Xi|  =  lYil  and  IX2I  =  ITjI- 
There  are  A:  •  |7i|  edges  from  nodes  in  Yi.  Because  of  the  edge  from  Xi  to  Y2,  there  can 
be  at  most  jfc  •  |Xi|  -  1  =  Jfe  •  |lil  -  1  edges  between  Yi  and  Xi.  Thus  there  must  be  an 
edge  C2  from  Yi  to  X2.  Since  neither  ei  nor  62  is  a  matching  edge,  Gi  and  G2  are  strongly 
connected  together.  | 

At  first  it  might  seem  like  we  have  just  solved  our  overall  problem.  We  haven’t.  The 
deficiency  isn’t  that  our  algorithm  only  works  for  3-regular  graphs  —  we  only  used  the 
3-regularity  of  the  graphs  to  show  that  we  had  a  PLATEAU-SCC  problem.  The  algorithm 
will  solve  any  PLATEAU  problem  with  <  =  3.  We  can  also  use  it  to  find  MMBNPS  with 
borrowing  =  3. 

The  shortcoming  is  this:  while  being  able  to  solve  PLATEAUs  and  MMBNPS  implies 
that  we  can  solve  PLT,  it  is  not  true  that  solving  PLATEAU  with  <  =  3  and  MMBNPS 
with  borrowing  =  3  implies  that  we  can  solve  PLT3.  Indeed,  solving  a  PLT3  problem  may 
involve  solving  PLATEAU  problems  with  arbitrarily  large  t. 
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4.5  Approximations 

We  have  seen  that  given  a  PLATEAU-SCC  piece,  we  can  find  an  optimal  schedule  in 
polynomial  time,  provided  that  the  delay  of  the  optimal  schedule  is  below  some  fixed  k  of 
our  choosing.  If  the  actual  delay  is  large,  this  is  not  useful.  In  this  case,  we  would  stiU  Uke 
to  be  able  to  get  an  approximation  to  the  delay,  and  to  get  a  schedule  that  is  approximately 

optimum. 

Consider  the  simplest  approximation  algorithm:  While  lyj  >  0,  choose  the  node  y  E  Y 
of  minimum  degree  and  schedule  the  nodes  r(y).  If  there  is  a  tie  for  minimum  degree,  pick 

randomly. 

What  can  we  say  about  this  approximation  algorithm?  Let’s  concentrate  on  graphs  with 
degree  <  3.  Also  let’s  assume  we  have  a  single  PLATEAU  piece  with  m  =  mx  =  ’Tiy. 

The  first  thing  to  note  is  that:  after  our  algorithm  picks  the  first  Y  node  and  schedules 
r(y),  the  graph  will  have  [Jfl  <  |y|.  This  will  hold  true  until  all  the  Y  nodes  are  exhausted. 

What  does  this  tell  us? 

Lemma  4.8  If  we  have  a  graph  with  X  nodes  of  degree  <  c,  then  some  Y  node  must  have 
degree  <  c  •  |X|/|y|. 

Proof:  This  is  a  simple  pigeon  hole  argument.  There  are  <  c  •  IX]  edges.  If  every  Y 
node  had  degree  >  c  •  |Xl/|y|,  then  we  would  have  >  c  •  |X|  edges.  Contradiction.  |. 

Corollary  4.9  If  we  have  a  graph  with  X  nodes  of  degree  <  3,  and  |X|  <  jyj  then  some 
y  node  must  have  degree  <2.  | 

This  gives  us  our  first  bound  on  the  overall  length  of  the  schedule  that  the  approximation 
algorithm  will  give  us. 

Lemma  4.10  The  approximation  algorithm  will  always  return  a  schedule  with  a  length  of 
at  most  1.5m  +  2  for  a  graph  with  max  Y  degree  =  3. 

Proof:  The  way  to  determine  the  length  is  to  find  a  limit  on  the  number  of  idle  y  jobs 
that  can  be  needed.  When  we  choose  a  Y  job  and  schedule  r(y),  we  need  |r(y)l  -  1  idle  Y 
jobs.  We  also  need  one  additional  idle  Y  job  at  time  1. 
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In  the  worst  case  the  first  Y  node  has  \T{y)\  =  3.  Thereafter,  CoroUary  4.9  teUs  us  that 
\T{y)\  <  2,  since  \X\  <  |r|.  We  can  have  |r(y)l  =  2  for  at  most  (m  -  3)/2  y-jobs.  So  we 
have  a  total  of  at  most  (m  -  3)/2  +  3  idle  jobs.  Thus  our  overall  length  is  <  1.5m  +  2.  | 

But  we  can  do  even  better  than  that.  This  is  because  if  the  schedule  is  really  using  one 
idle  Y  job  per  Y  job  released,  then  we  will  get  to  a  point  where  \Y\  >  1.5  •  IXj.  If  that 
happens,  then  Lemma  4.8  tells  us  that  there  is  a  T  node  of  degree  =  1.  Thus  it  would  seem 
that  we  wouldn’t  need  any  more  idle  y-slots  after  that  time.  There  is  one  complication. 
The  X  node  may  release  more  than  one  Y  node.  What  happens  if  the  balance  of  s  and 
y’s  shifts  back  so  that  \Y\  =  1.5  •  |X|  or  \Y]  <  1.5  •  |X|?  By  careful  counting  this  can  be 
shown  to  not  be  a  problem. 

Lemma  4.11  Given  a  graph  with  degree  <  3  for  the  X  nodes,  and  with  my  >  1-5  • 
where  mjr  =  |^|  and  my  =  \Y\.  The  approximation  algorithm  will  schedule  this  with  only 

one  idle  Y  job. 

Proof:  The  idle  Y  job  will  obviously  be  at  time  1.  What  we  need  to  show  is  that  if  we 
run  into  a  situation  where  all  the  Y  nodes  have  degree  >  2,  then  we  will  have  accumulated 
enough  excess  released  Y  jobs  that  we  can  execute  them  instead  of  having  an  idle  Y  job. 
To  do  this  we  will  show  that  the  invariant 

|y|  +  excess  —  1  >  IX]  +  mxl2 

always  holds  as  1X|  and  |y|  vary  and  that  excess,  the  number  of  excess  available  y  jobs,  is 
always  >  0. 

At  the  start  of  our  execution,  |X|  =  mx  and  |y|  =  my.  Since  my  >  1.5  •  mx,  then 
|y|  —  1  >  1.5  •  |JC|.  Since  excess  =  0,  the  invariant  holds  at  the  beginning  of  the  execution. 
How  does  it  change  as  we  go  along? 

CASE  1:  We  can  schedule  one  X  job  which  releases  k  >1Y  jobs.  Since  jy]  decreases  by 
k,  excess  increases  by  fc  —  1  and  [Jf  |  decreases  by  1,  the  invariant  remains  satisfied. 

CASE  2:  We  need  to  schedule  two  X  jobs  in  order  to  release  fc  >  1  y  jobs.  First  excess 
decreases  by  1,  then  |y|  decreases  by  k,  excess  increases  by  A:  -  1  and  jX"!  decreases 
by  2.  At  the  end  the  invariant  is  satisfied,  but  we  need  to  verify  that  excess  >  1  at 
the  beginning.  Since  we  needed  two  X  jobs,  we  know  that  initially  |y|  <  1.5  •  jXj. 
Since  |y|  +  crcess  -  1  >  \X\  +  mx/2  >  1.5- |X1,  then  cicess  >  1.5  •  jXj  -  |y|  +  1  >  1. 
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CASE  3:  We  need  to  schedule  three  X  jobs  in  order  to  release  k  >  lY  jobs.  First  excess 
decreases  by  2,  then  |y|  decreases  by  k,  excess  increases  by  fc  -  1  and  |X|  decreases 
by  3.  At  the  end  the  invariant  is  satisfied,  but  we  need  to  verify  that  excess  >  2  at 
the  beginning.  Since  we  needed  three  X  jobs,  we  know  that  initially  |y|  <  |X|.  Since 
|y|  +  excess  -1>\X\  +  mx/2,  then  excess  >  mx/2  +  |X|  -  |y|  +  1  >  mx/2  +  1.  For 
case  3  to  be  possible,  we  must  have  mx  >  3,  so  everything  is  OK. 


I 


So  what  is  the  largest  number  of  idle  jobs  we  can  need  before  we  hit  the  first  crossover 
point  where  jy]  >  1.5  •  1X|? 

For  the  portion  of  the  schedule  before  we  hit  |y|  >  1.5  •  jXl,  let  ji  be  the  number  of 
y  jobs  that  had  |r(y)|  =  i  when  they  were  picked  by  the  algorithm.  We  wiU  have  used 
2j3  +  J2  +  1  idle  jobs.  When  we  hit  the  crossover  point,  |y|  =  my  -  js  -  32  -  ji  and 
|X|  =  mx  -  3i3  -  2j2  -  Ji •  At  crossover  [yj  <  1.5  •  IX]  + 1,  so  my  -  js  -  ji  -  Ji  <  l-5mjf  - 
4.5j3  -  3j2  -  l-5ji  +  1-  Simplifying,  and  plugging  in  ja  =  1,  gives  2^2  +  ii/2  <  {mx  -  5)/2. 
We  want  to  choose  y'a  and  ji  to  maximize  j2  +  3,  which  means  ji  —  0  and  j2  =  {mx  —  5)/4. 
Hence  the  overall  idle  time  is  {mx  +  7)/4. 

Thus  we  can  replace  Lemma  4.10  with 

Lemma  4.12  The  approximation  algorithm  will  always  return  a  schedule  with  a  length  of 
at  most  (5m  +  7)/4  +  1  for  a  graph  with  max  Y  degree  =  3.  I 

Figure  4.11  shows  a  near  worst  case  example  where  this  bound  is  almost  achieved  because 
of  extreme  unluckiness  by  the  algorithm  when  it  chooses  between  nodes  of  equal  degree. 
The  6/8m  unshown  edges  from  A  go  to  the  nodes  in  B'  and  C  j  each  node  in  B  and  C 
touches  exactly  one  of  these  edges.  The  3/8m  unshown  edges  from  B  go  to  the  nodes  in  C"; 
each  node  in  C  touches  exactly  one  of  these  edges.  We’ll  specify  more  detail  in  a  moment. 
This  detail  is  enough  to  see  that  the  schedule  length  is  5m/4  +  1  without  the  dashed  edge 
and  (5m  +  4)/4  +  1  with  the  dashed  edge. 

How  does  this  compare  to  the  optimum  for  this  graph?  Not  very  well.  By  filling  in  the 
unspecified  edges  in  the  appropriate  pattern,  we  can  repeatedly  alternate  scheduling  A,  B 
and  C  nodes  in  the  pattern  shown  by  Figure  4.12.  There  will  be  some  initial  start-up  delay. 
But  once  we  get  over  that  constant  amount  of  borrowing,  we  won’t  need  any  more. 
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Figure  4.11:  Worst  Case  for  Approximation 


CHAPTER  4.  ONE,  TWO,  THREE,  APPROXIMATION 


Figure  4.12:  A  constant  borrowing  schedule 
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Hence, 

p=^^  =  1.25  +  0(l/n) 

opt 

So  that’s  as  well  as  this  approximation  scheme  does.  What  about  other  approximation 
algorithms?  Can  we  find  an  algorithm  that  will  approximate  to  within  a  constant?  This 
turns  out  not  to  be  possible  unless  we  could  solve  P  LT  itself. 

Lemma  4.13  If,  using  a  polynomial  amount  of  time,  we  could  approximate  the  schedule 
for  an  arbitrary  graph  such  that 

approx  —  opt  <  c 


then  PLT  €  P. 

Proof:  Take  an  instance  of  PLT.  Multiply  the  weights  by  c  +  I.  If  we  can  approximate 
the  new  problem  to  within  a  constant  c,  then  this  gives  us  an  optimum  for  the  original.  I 

What  about  approx  —  opt  <  log  n  or  approx  —  opt  < 

Assume  that  we  can  approximate  so  that  approx  —  opt  <  f{n)  for  some  function  /(n). 
We  will  multiply  the  weights  of  the  graph  by  another  function  g{n)  + 1.  If  /(n  •  (p(n)  + 1))  < 
g[n)  +  1  then  we  can  use  the  approximation  to  the  weighted  version  to  solve  the  original. 

Assume  that  /(n)  =  n*  for  some  c.  Let’s  choose  g{n)  =  -  1.  For  what  values  of 

€  can  we  find  a  6  such  that  /(n  •  (p(n)  +  1))  <  g{n)  +  1? 

We  want  So 


(1 + sy 

< 

(l  +  i)< 

(1  +  6)€^ 

< 

^6 

(1  +  ^)6 

< 

6 

e 

< 

{l-e)S 

6/(1  -  6) 

< 

6 

That  is,  for  every  constant  e  <  1,  there  is  a  constant  6  such  that  /(n  •  {g{n)  +  1))  < 
g{n)  +  1.  Hence, 

Lemma  4.14  Any  polynomial  time  approximation  algorithm  with  a  bound  of  approx  - 
opt  <  f[n)  =  n'  for  e  <  1  implies  a  polynomial  solution  to  PLT  itself.  | 


Chapter  5 


Lower  Bound  Techniques 


The  type  of  approximation  algorithms  that  we  were  just  considering  give  upper  bounds  on 
what  the  actual  overhang  is.  It  is  also  interesting  to  be  able  to  obtain  lower  bounds  on  the 
overhang. 


5.1  Simple  Methods 

A  few  lower  bound  techniques  are  easy  to  validate. 

Lemma  6.1  If  M  contains  a  bipartite  clique  Kc,d  as  a  subgraph  and  c,d  >  0,  then  the 
overhang  of  M  is  >  c  +  d—  IVI. 

Proof:  Even  if  the  clique  is  stuck  in  the  lower-left-hand  corner,  as  shown  in  Figure  5.1, 
its  upper-right  corner  will  protrude  enough  to  ensure  that  the  overhang  is  >  c  +  d  —  lyj. 

I 

A  clique  which  maximizes  c  +  d  can  be  found  in  polynomial  time  by  complementing 
the  edge  set  (E'  =  {(x,y)|  x  e  X,y  e  Y,{x,y)  i  E})  and  then  finding  the  maximum 
independent  set.  We  can  find  the  maximum  independent  set  in  polynomial  time  since  our 
graph  is  bipartite. 

Another  obvious  lower  bounding  technique  is  less  computationally  feasible. 

Lemma  5.2  If  every  subset  y'  C  Y  ofsizechas  |r(y')|  >  d,  then  the  overhang  is  >  d-c+1. 

Proof:  Let  Y'  be  the  first  c  nodes  released  by  a  schedule  and  let  X'  =  r(y').  Segregate 
X'  to  the  front  of  Sx-  Now,  Y'  and  X'  are  in  the  upper  left  corner  as  shown  in  Figure  5.2. 
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c+d-|Y| 


d 

Figure  5.1:  overhang  >  c  +  d  —  |y| 


There  must  be  an  X  in  each  column  of  the  sub-rectangle.  If  the  lower-right  corner  of  this 
sub- rectangle  is  occupied  by  an  edge,  then  the  overhang  is  >  d  —  c  -t-  1.  If  the  x  in  that 
column  where  in  any  other  row,  then  the  overhang  would  be  worse:  >  d  —  c  -1- 1.  Similarly, 
if  X'  isn’t  restricted  to  the  first  d  columns,  then  the  overhang  would  be>d-c-f-l.  I 

Corollary  5.3  If  every  subset  of  X  nodes  of  size  i  has  |r^(Jf)|  <  j,  then  the  overhang  is 
>  t  -  i  -f  1.  I 

This  is  a  more  general  version  of  a  previous  result: 

Lemma  5.4  [Too87]  Given  an  n  X  n  binary  matrix  M.  plt(M,  1)  =J>  there  is  no  2  <  f  <  n 
such  that  t  columns  of  the  matrix  each  have  >  n  -  t  -f  2  Is.  | 

As  an  example,  consider  the  graph  in  Figure  5.3.  Every  set  of  Y  nodes  of  size  2  is 
adjacent  to  5  X  nodes,  so  by  Lemma  5.2  the  overhang  >5  —  2-1-1  =  4.  Equivalently, 
every  set  of  5  X  nodes  releases  no  more  than  2  Y  nodes,  so  by  Corollary  5.3  the  overhang 
>5-24-1  =  4. 

This  technique  is  potentially  exponential,  and  is  not  guaranteed  to  yield  the  correct 
result.  For  example.  Figure  5.4  shows  a  graph  (X  nodes  are  filled  in  in  black)  with  overhang 
7  for  which  Lemma  5.2  can  only  show  a  lower  bound  of  6  (the  dashed  splines  enclose  a  Y 
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Figure  5.2:  overhang  >  d  —  c  +  1 


0  1  2  3  4  5  6 

0  X  •  X  X  • 

1  •  X  •  X  X  • 

2  •  •  X  •  X  X  • 

3  •  •  •  X  •  X  X 

4  X  •  •  •  X  •  X 

5  X  X  •  •  -x- 

6  •  X  X  •  •  •  X 


Figure  5.3:  3- regular  graph 
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that  minimizes  |r(y)|  over  aU  y  of  a  fixed  |yi).  To  see  that  it  has  overhang  7,  delete  the 
dashed  edge  and  the  two  nodes  dangling  from  it.  In  the  resulting  graph  every  set  of  7  T 
nodes  has  13  X  nodes  as  neighbors,  so  Lemma  5.2  tells  us  that  the  overhang  is  7.  Adding 
the  two  nodes  back  to  the  graph  can  not  reduce  the  overhang. 

5.2  Permanents  and  Perfect  Matchings 

Matchings  also  shed  some  light  on  the  overhang. 

When  a  maximum  matching  touches  every  node  in  the  graph,  we  say  that  it  is  a  perfect 
matching.  One  useful  function  of  an  adjacency  matrix  M  of  &  bipartite  graph  with  1X|  = 
|y|  =  n  is  the  permanent  of  the  matrix,  which  we  will  write  as  per(M ).  The  permanent  is 
equal  to  the  number  of  different  perfect  matchings  of  the  graph. 

Another  previous  result  is: 

Lemma  6.5  [Too87]  Given  an  n  x  n  binary  matrix  M .  plt(M,  1)  =>  the  permanent  of  M 
is  0  or  1.  I 

This  is  straightforward  once  it  is  stated.  Another  straightforward  result  is  that: 

Lemma  6.6  Given  an  n  x  n  binary  matrix  M.  The  permanent  of  M  is  1  =>  plt(M,  1). 

Proof!  Since  per(M)  =  1,  there  is  exactly  one  maximum  matching  of  the  graph.  As  we 
did  in  Section  2,  let’s  reverse  the  directions  of  the  edges  in  the  maximum  matching.  The 
resulting  graph  is  a  directed- acyclic  graph.  Give  the  Y  nodes  a  topological  numbering.  For 
each  edge  y  —*  x,  give  x  the  same  number  as  y.  If  we  schedule  the  X  nodes  and  the  Y 
nodes  in  the  order  of  this  numbering,  the  resulting  matrix  will  be  lower-triangular  with  Is 
down  the  diagonal.  | 

Furthermore,  Lemma  5.5  can  be  generalized  to: 

Lemma  5.7  If  the  n  X  n  matrix  M  has  overhang  A:  >  0,  then  per(Af)  <  A:”"* A:!. 

Proof:  The  question  here  is:  what  is  the  largest  possible  permanent  for  an  n  x  n  matrix 
with  overhang  kl  We  will  call  this  P(n,  At).  Clearly  adding  more  Is  to  the  matrix  can  only 
increase  the  permanent.  Hence  what  we  need  to  figure  out  is  the  permanent  of  the  matrix 
with  all  Is  below  the  A:-diagonal.  Figure  5.5  shows  the  case  where  n  =  6  and  k  —  3.  If 


Figure  5.4:  Lower  bo’ 
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0  1  2  3  4  5 

Oxxx- 
Ixxxx- 

2  X  X  X  X  X  • 

3  X  X  X  X  X  X 

4  X  X  X  X  X  X 

5  X  X  X  X  X  X 

Figure  5.5:  Matrix  for  P(6,3) 

k  =  n,  clearly  the  permanent  is  n!.  That  is,  P(n,  n)  =  n!.  If  A:  <  n,  then  look  at  the  column 
with  k  Is  in  it.  There  are  k  choices  for  which  row  to  match  to  that  column.  After  we  have 
made  that  choice,  the  remaining  matrix  is  an  (n  -  1)  X  (n  -  1)  matrix  with  overhang  k 
and  with  all  Is  below  the  A-diagonal.  That  is,  if  fc  <  n,  P{n,k)  =  k  ■  P{n  —  l,k).  Hence, 

Pin,k)  =  k^-^k\.  I 

The  non-monotonicity  of  the  implications  of  the  permanent  is  interesting;  per(M)  >  2 
‘no’;  per(M)  =  1  =►  ‘yes’;  per(M)  =  0  =;►  ‘no  information’. 

5.2.1  Subarrays 

Checking  if  the  permanent  =  0  or  =  1  is  easy.  However,  the  problem  of  computing  the 
permanent  is  #P-Complete  [Val79].  That  is,  it  is  hard  to  compute.  For  the  moment,  we 
will  ignore  this  and  see  what  light  the  permanent  sheds  on  our  problem. 

In  particular,  what  pesdcs  our  interest  are  the  following  simple  lemmas. 

Lemma  6.8  If  the  matrix  M  has  overhang  k{k>  1)  then  it  has  an  (n  -  A:  -|- 1)  x  (n  -  A:  -|- 1) 
subarray  M'  of  overhang  1.  I 

Lemma  5.9  If  the  matrix  M  has  a  n'  x  n'  subarray  M'  with  overhang  A:  (A:  >  0)  then 
plt(Af,n  -  n' -I- A:).  | 

Hence,  if  the  matrix  M  has  an  n'  x  n'  subarray  M'  with  per(Af')  =  1,  then  plt(M,n  - 
n'  +  1),  since  M'  has  overhang  1. 

By  Lemma  5.5  the  matrix  M'  in  Lemma  5.8  must  have  per(M')  =  0  or  per(M')  =  1.  In 
the  case  where  per(M')  =  0,  is  there  some  other  submatrix  M"  of  the  same  size  such  that 

per(M")  =  1? 
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0  12  3 

Oxx-- 
Ixx-- 

2  X  X  X  X 

3  X  X  X  X 

Figure  5.6:  Counterexample 

If  so,  then  there  always  exists  a  maximum  submatrix  with  per(M")  <  1,  which  actually 
has  per(M)  =  1.  The  following  lemma  almost  gives  us  what  we  want. 

Lemma  6.10  For  a  matrix  M  let  M'  be  a  largest  submatrix  with  per(Jl/')  <  1.  If 
per(M')  =  0,  then  overhang(M)  >  n-n'+l.  If  per(M')  =  1,  then  overhang(M)  =  n— n'+l. 

Proof:  Assume  to  the  contrary  that  M  has  overhang  k  <  n  —  n'  +  1.  Hence  there  is 
an  (n  —  A:  +  1)  X  (n  -  A:  +  1)  subarray  M"  of  overhang  1  (by  Lemma  5.8).  By  Lemma  5.5, 
M"  has  per(M'')  <  1.  But  by  assumption  k  <  n  —  n'  +  1  and  hence  n  —  A:  +  1  >  n'.  This 
contradicts  our  assumption  that  M'  was  the  largest  subarray  with  per(M^)  <  1.  Hence, 
overhang(M)  >  n  —  n'  +  1. 

Furthermore,  in  the  case  where  per(M^)  =  1,  we  know  that  overhang(M^)  =  1.  Then 
Lemma  5.9  teUs  us  that  M  has  overhang  <  n  —  n'  +  1.  Hence,  overhang{M)  =  n  —  n'  +  1. 

I 


But  unfortunately,  our  desired  lemma  is  not  true: 

False  Lemma  6.11  Let  M'  be  the  largest  subarray  of  M  with  per(Af')  =  1.  Then 
overhang(M)  =  n  —  n'  +  1.  | 

The  counterexample  in  Figure  5.6  has  numerous  2x2  subarrays  with  permanent  1,  no 
3x3  subarrays  with  permanent  1,  and  yet  has  an  overhang  of  only  2. 

Another  tempting  but  false  lemma  is: 

False  Lemma  6.12  Let  the  maximum  subarray  M'  with  per(M')  <  1  have  overhang  k. 
Then  the  array  M  has  overhang  n-n'  +  k.  I 

As  the  maximum  such  subarray  is  [j/o>yi>y2>y3)y4][®4i®5>®6)®7)®8])  the  lemma  would 
imply  that  the  array  in  Figure  5.7  has  overhang  7,  when  it  actually  has  overhang  6.  To 
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Xo  Xi 

I/O  X  X 

1/1  X  X 

2/2  X  X 

1/3  X  X 

1/4  X  X 

2/5  X  X 

2/6  •  X 

2/7  • 

2/8 


12  ®3  ®5  ®8 

XX . 

X  X  X  X  X  X 

X  X  X  X  X  X 

X  X  X  X  X  X 

X  X  X  X  X  X 

X  X  X  X  X  X  X 

X  X  X  X  X  X  X 

X  X  X  X  X  X  X 

X  X  X  X  X  X 


Figure  5.7:  Counterexample 


see  that  it  has  overhang  6,  simply  transpose  the  matrix;  that  is,  the  following  is  a  valid 
schedule: 


Sx  =  [x^,X7,Xs,Xs,X4,X3,X2,Xi,Xo] 

Sy  =  [0,0, 0,0, 0,0, 2/8, 2/7,  2/6, 1/5, 2/4,  2/3, 2/2, 2/1,  l/o] 


5.3  Network  Flow 

We  can  also  use  network  flow  techniques  to  obtain  lower  bounds  on  the  overhang  for  a 
graph. 

We  will  use  sets  of  source  nodes  and  sets  of  sink  nodes.  The  sources  and  sinks  will  either 
all  be  X  nodes  or  all  Y"  nodes.  We  will  set  the  capacities  of  the  nodes  and  the  edges  to  1. 
Thus  the  flow  is  actually  the  number  of  vertex  disjoint  paths  between  the  two  sets  of  nodes. 
Let’s  start  out  with  the  simplest  case.  Consider  the  graph  in  Figure  5.8. 
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Lemma  6.18  Given  a  unit-capacity  graph  and  a  single  source  node  yi  and  a  single  sink 
node  yj.  Let  the  maximum  flow  from  source  to  sink  be  k  and  assume  that  the  set  of  k 
disjoint  paths  uses  every  node  in  the  graph.  Then,  the  overhang  of  the  graph  is  >  k. 

Proof:  The  basic  idea  is  that  there  is  no  way  to  release  either  the  source  or  the  sink 
without  borrowing  k.  Hence  there  is  no  way  to  schedule  the  whole  graph  without  borrowing 
k. 

Let  the  first  source/sink  released  be  in  ATj.  Let’s  look  at  Sj  and  Tj_i.  Claim  nj-i  = 

To  see  this  claim,  we  partition  each  of  the  sets  Sj  and  Tj-i  into  k  pieces.  Since  the  graph 
has  a  flow  of  k  between  the  source  and  sink,  there  are  k  vertex  disjoint  paths  between  yi 
and  y2>  Number  these  paths  from  1  to  k.  Let  Ui  be  the  nodes  of  Sj  that  are  on  path  number 
i.  Let  Vi  be  the  nodes  of  Tj-\  that  are  on  path  number  i.  For  all  i,  |i/i|  >  |Vi|  -f  1.  To  see 
this,  look  at  two  cases.  If  |l^|  >  0,  then  Ui  must  contain  all  the  X  nodes  on  path  i  that  are 
adjacent  to  the  Y  nodes  in  Vi  and  hence  |l7i|  >  |Vi|  1.  If  |Vi|  =  0,  then  |17t|  >  |Vi|  4-  1, 
since  in  order  for  the  source/sink  to  be  released  in  AT,-,  we  must  have  \Ui\  >  1  for  all  t. 

Thus,  iT,-_ii  - 15,-|  =  i:?=i  m  -  E?=i  \Ui\  =  zLim  -  m)  <  e*=i  -i  =  -*• 

Hence,  overhang  >  k.  I 

The  cases  where  the  flow  actually  uses  up  the  whole  graph  will  be  rare.  If  the  flow 
doesn’t  use  all  the  nodes,  then  does  the  flow  still  give  us  some  information?  Yes. 

Lemma  5.14  Given  a  unit-capacity  graph  and  a  single  source  node  yi  and  a  single  sink 
node  y2.  Let  the  maximum  flow  from  source  to  sink  be  k.  There  may  be  several  ways 
to  achieve  a  /.-flow.  For  each  such  fc-flow,  there  will  be  some  Y  nodes  that  have  no  flow 
through  them.  If  delay(5)  <  k,  then  for  each  possible  fc-flow  from  yi  to  ya,  S  must  release 
some  Y  node  with  no  flow  through  it  strictly  before  5  releases  yi  or  ya. 

Proof:  Assume  to  the  contrary  that  there  is  some  schedule  5  with  delay(5)  <  k  and 
there  is  some  Jfc-flow  from  yi  to  ya  such  that  the  source  or  the  sink  is  released  before  a  Y  node 
with  zero  flow  is  released.  Let  the  first  source/sink  released  be  in  AT,.  The  equations  in 
Lemma  5.13  involving  (t/il  and  |T^|  still  hold,  and  |Tj_i|  —  |5j|  <  —k.  Thus  delay(5)/yegA:. 
Contradiction.  | 


Using  different  choices  for  sources  and  sinks,  this  lemma  gives  us  sets  of  conditions  on 
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^3 

Figure  5.10;  overhang  =  2 


allowable  schedules.  Along  with  other  constraints,  we  can  prove  lower  bounds  on  the  delays 
of  certsdn  graphs. 

Let’s  examine  the  example  in  Figure  5.9.  There  is  a  flow  of  3  from  yi  to  yj  that  uses 
all  Y  nodes  except  ya.  Lemma  5.14  tells  us  that  ys  must  be  released  strictly  before  either 
yi  or  y2  in  any  schedule  with  delay  <  3.  Similarly,  the  flow  of  3  from  yi  to  ya  that  uses 
all  Y  nodes  except  ya  tells  us  that  ya  must  be  released  strictly  before  either  yi  or  ya  in 
any  schedule  with  delay  <  3.  Since  these  two  conditions  are  contradictory,  no  delay  <  3 
schedule  is  possible,  and  overhang  >  3. 

The  example  in  Figure  5.10  is  different.  Here,  there  is  a  flow  of  3  from  yi  to  yj  that 
uses  all  Y  nodes  except  ya-  This  means  that  ya  must  be  released  strictly  before  either  yi 
or  ya  in  any  schedule  with  delay  <  3.  Similarly,  there  is  a  flow  of  3  from  yi  to  ya  that  uses 
all  Y  nodes  except  y^  and  this  tells  us  that  y^  must  be  released  strictly  before  either  yi  or 
ya  in  any  schedule  with  delay  <  3.  These  constraints  present  no  contradiction,  and  there  is 
indeed  a  schedule  with  a  delay  of  2. 

One  limitation  of  Lemmas  5.13  and  5.14  is  that  they  can  at  best  lower  bound  the 
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Figure  5.11:  Flow  of  4,  before  expanding  yi  and  3/2 


Figure  5.12:  After  expanding  yi  and  y2 


overhang  by  the  maximum  degree  of  the  graph.  Clearly  we  should  be  able  to  do  better: 
Suppose  we  start  with  the  graph  in  Figure  5.11,  which  has  a  flow  of  4,  and  then  we  apply 
the  Expanding  Transformation  to  it,  yielding  the  equivalent  graph  in  Figure  5.12,  with  a 
maximum  degree  3.  Now  the  flow  can  be  at  most  3.  It  would  be  nice  if  transforming  the 
graph  didn’t  diminish  our  ability  to  prove  a  lower  bound.  We  will  need  a  more  powerful 
lemma.  We  need  to  use  sets  of  sources  and  sinks  in  order  to  have  a  flow  of  4. 

We  will  also  need  to  have  ligament  sets.  A  ligament  set  will  be  a  set  of  X  nodes  that 
intuitively  ties  together  the  Y  nodes  of  the  sink  (or  source).  For  example,  in  Figure  5.13, 
{*4}  is  a  valid  ligament  set  for  the  sink  node  set  {j/4,  j/5},  and  {*1 ,  ®2, 0:3}  is  a  valid  ligament 
set  for  the  source  node  set  {3/1,  For  a  source/sink  node  set  Y',  the  ligament  set 

i(y')  C  X  must  satisfy  the  following  property; 

In  the  graph  induced  by  (Y’UL{Y')),  we  must  have  |r'(AOI  <  lA’'!  for  every 
X'  C  L{Y')  that  has  |A'|  <  |y'|  -  1.  That  is,  no  subset  X'  of  the  ligament 
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Figure  5.13:  Example  of  ligament  sets 

nodes  can  release  more  than  |.y^|  nodes  of  the  source/sink. 

One  consequence  of  this  property  is  that  \L{Y')\  >  \Y'\  -  1,  since  otherwise  X'  =  L{Y')  has 

\X'\  <  iri  - 1  and  |r'(xoi  =  ir'(i(y'))l  =  1^1  >  l^'l  + 1  >  l^'l- 

Note  that  the  source  and  sinks  sets  don’t  need  to  be  connected  sets;  for  example,  the 
source  and  sink  sets  shown  in  Figure  5.13  are  allowed. 

Lastly,  we  will  not  allow  the  flows  to  go  through  the  ligament  set.  One  way  to  visualize 
this  is  that  L{Y')  and  L{Y)  are  removed  from  the  graph  before  the  flow  is  computed. 

So  what  we  have  is  this: 

Lemma  5.16  We  have  a  source  node  set  Yi  and  a  sink  node  set  Y2  with  disjoint  ligaments 
sets  L{Yi)  and  L{Y2),  respectively.  Let  the  maximum  flow  in  G  \  (1(^1)  U  L{Y2))  be  k  and 
assume  that  the  set  of  k  disjoint  paths  uses  every  node  in  the  graph  G  \  (//(Fi)  U 
Then,  the  overhang  of  the  graph  G  is  >  k. 

Proof:  The  basic  idea  remains  similar.  There  is  no  way  to  release  either  all  the  nodes 
in  Yi  or  all  the  nodes  in  12  without  borrowing  k.  Hence  there  is  no  way  to  schedule  the 
whole  graph  without  borrowing  k. 

Assume  without  loss  of  generality  that  the  schedule  releases  all  of  the  nodes  of  the  source 
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before  it  releases  all  the  nodes  of  the  sink.  Let  the  the  last  node  of  the  source  be  released 
in  ATj.  As  before,  let’s  look  at  Sj  and  Tj_i.  Claim  nj^i  =  ITj-il  —  |Sj|  <  —k. 

To  see  this  claim,  we  partition  each  of  the  sets  Sj  and  Tj-i  into  /t  +  2  pieces.  Since  the 
graph  has  a  flow  of  A:  between  the  source  and  sink,  there  are  k  vertex  disjoint  paths  between 
j/i  and  2/2-  Number  these  paths  from  1  to  k.  Let  Ui  be  the  nodes  of  Sj  that  are  on  path 
number  i.  Let  Vi  be  the  nodes  of  Tj-i  that  are  on  path  number  t.  As  before,  for  1  <  t  <  fc, 

\Ui\  >  \Vi\  +  1. 

We  let  Uk+i  be  the  nodes  of  Sj  that  are  in  L{Y\).  We  let  Vk+\  be  the  nodes  of  Tj-\ 
that  are  in  Yi.  We  let  £/*+2  be  the  nodes  of  Sj  that  are  in  1(^2)-  We  let  1^+2  be  the  nodes 
of  Tj-i  that  are  in  ^2. 

We  claim  that  |Vlfc+i|  <  \Uk+i\-  In  verifying  this,  there  are  two  cases.  If|17*;+i|  > 
then  since  |Vfc+i|  <  |li|,  we  see  that  |Vfc+i|  <  If  |£^fc+i|  <  l^il  —  1»  then  the 

deflnition  of  ligament  sets  tells  us  that  in  the  graph  induced  by  (Yi  U  L{Yi)),  we  have 
ir'(fffc+i)|  <  |C^fc+i|;  since  Vjt+i  C  r'(£^fc+i),  then  |V)fe+i|  <  |I^fc+i|.  Similarly  we  can  see  that 

|Vfc+2|  <  \Uk+2\. 

Thus,  ir,_xi  -  \Sj\  =  iv;i  -  m  =  -i  =  -*• 

Hence,  overhang  >  k.  | 

Unfortunately,  unlike  with  Lemma  5.13,  there  is  no  clear  polynomial  time  algorithm  to 
And  the  source  and  sink  sets  that  yield  the  largest  k. 

As  with  the  simpler  source  and  sink  sets,  it  is  very  useful  to  have  a  lemma  that  handles 
the  case  when  the  flows  don’t  use  all  the  nodes: 


Lemma  5.16  We  have  a  source  node  set  Yj  and  a  sink  node  set  Y2  with  disjoint  ligaments 
sets  L{Yi)  and  L{Y2),  respectively.  Let  the  maximum  flow  in  (7  \  (L(Yi)  U  L{Y2))  be  k.  If 
delay(5')  <  k,  then  for  each  possible  fc-flow  from  Yi  to  l2>  S  must  release  some  Y  node  with 
no  flow  through  it  strictly  before  5  releases  either  all  the  nodes  of  I'l  or  aU  the  nodes  of  Yi- 

I 


For  example,  in  Figure  5.14  the  directed  arcs  show  a  flow  of  7  from  the  sinks  to  the 
sources.  Hence,  for  a  schedule  to  achieve  a  delay  less  than  7,  it  must  release  yi  or  2/2  before 
it  releases  all  the  nodes  of  either  the  source  or  sink. 
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Figure  5.14:  Flow  of  7  that  doesn’t  use  yi  and 


5.4  Minimal  Graphs 

Now  that  we  have  some  lower  bound  results,  we  can  prove  that  some  sets  of  graphs  are 
minimal  graphs.  That  is,  removing  any  edge  from  the  graph  reduces  its  overhang.  One 
interesting  reason  to  study  minimal  graphs  is  the  possibility  of  finding  a  polynomial  algo¬ 
rithm  to  verify  them.  If  there  were  a  polynomial  algorithm  that  could  take  a  graph  and 
verify  that  it  was  a  minimal  graph  of  overhang  k,  then  the  PLT  problem  would  be  in  coNP 
and  hence  in  coNP  n  NP. 

5.4.1  Simple  Minimals 

One  simple  class  of  minimal  graphs  is  suggested  by  Lemma  5.13.  We  take  two  Y  nodes  and 
string  k  disjoint  paths  between  them  as  in  Figure  5.15. 

Before  we  prove  that  these  are  minimal,  let’s  define  a  minimal  ligament  set.  In  graph 
G,  for  a  set  V,  we  will  call  a  ligament  set  L{Y)  minimal  if  (1)  L{Y)  C  1(7)  and  (2)  after 
removing  any  edge  between  Y  and  L{Y),  the  set  L{Y)  no  longer  satisfies  the  ligament  set 
property  with  respect  to  7. 
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Figure  5.15:  Minimal  graph  with  overhang  =  3 


Lemma  6.17  Given  a  graph  with  a  flow  of  k  between  disjoint  source  and  sink  sets  such 
that  (a)  the  ligament  sets  of  the  source  and  sink  are  minimal  ligament  sets  and  (b)  every 
edge  that  is  not  in  the  ligament  is  used  by  the  flow.  Such  a  graph  is  a  minimal  graph  with 
overhang  k. 

Proof:  We  already  know  that  overhang  >  k.  We  will  show  that  deleting  any  edge 
e  =  (®,y)  leaves  overhang  =  A;  —  1.  There  are  two  cases. 

If  the  deleted  edge  e  is  a  flow  edge:  (Follow  along  in  Figure  5.16.)  In  the  graph  G\e, 
there  is  a  unique  path  from  y  to  a  source/sink  node.  Schedule  the  X  nodes  along  this 
path  in  order  (ii).  Schedule  all  remaining  X  nodes  adjacent  to  that  source/sink  node 
(i2, 13).  Iterate  through  each  remaining  Y  node  that  is  in  the  same  source/sink  node 
set,  scheduling  all  remaining  X  nodes  adjacent  to  the  Y  node  (X4,  15).  Then  repeat 
until  done:  for  each  Y  node  of  degree  1,  schedule  its  adjacent  X  node  (xe,  X7,  xs,  xg, 

®io>  ®ii>  ®12>  *13)- 

If  the  deleted  edge  c  is  between  a  ligament  node  and  a  source/sink  node: 

(Follow  along  in  Figure  5.17.)  Removing  the  edge  breaks  the  ligament  set  into  two 
pieces.  For  the  piece  that  contains  y,  iterate  through  each  remaining  Y  node,  schedul¬ 
ing  all  remaining  X  nodes  adjacent  to  the  Y  node  (xj,  X2).  Then  as  long  as  possible, 
for  each  Y  node  of  degree  1,  schedule  its  adjacent  X  node  (xg,  X4,  xg,  xe,  i?)-  If 
anything  remains,  it  includes  the  other  source/sink  node  set.  Iterate  through  each 
remaining  Y  node  that  is  in  that  source/sink  node  set,  scheduling  ail  remaining  X 
nodes  adjacent  to  the  Y  node  (xg,  xg,  xio)-  Then  repeat  until  done:  for  each  Y  node 
of  degree  1,  schedule  its  adjacent  X  node  (xn,  X12,  iia)- 


I 


Figure  5.16:  deleting  a  flow  edge 


Figure  5.17:  deleting  a  ligament  edge 
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So  besides  proving  that  Figures  5.15  is  minimal  with  overhang  3,  this  shows  that  Fig¬ 
ure  5.13  is  minimal  with  overhang  6.  Note  that  while  we  already  knew  that  these  were  the 
overhangs  of  the  graphs,  we  did  not  know  that  the  graphs  were  minimal,  since  the  following 
lemma  is  false. 

False  Lemma  6.18  Let  G'  be  a  graph  derived  from  G  by  applying  the  Collapsing  Trans¬ 
formation.  If  G'  is  minimal  for  overhang  k,  then  so  is  G.  | 

A  simple  counterexample  can  be  seen  in  Figure  4.2.  Both  graphs  have  overhang  1.  The 
graph  on  the  right  is  minimal,  but  deleting  the  x\  —*  y  edge  or  the  X2  y  edge  does  not 
reduce  the  overhang  of  the  graph  on  the  left. 

5.4.2  Disconnected  Minimals 

These  simple  minimals  will  always  be  connected  graphs.  It  turns  out  that  under  the  right 
conditions  a  set  of  unconnected  minimals  will  be  a  minimal  graph. 

But,  we  must  build  up  some  tools,  first. 

Lemma  6.19  A  connected  minimal  graph  is  either  an  ascent  piece,  a  descent  piece  or  a 
plateau  piece. 

Proof:  Assume  to  the  contrary  that  the  graph  has  two  or  more  such  pieces.  Since  the 
graph  is  connected,  there  is  an  edge  between  two  of  the  pieces.  If  this  edge  is  removed, 
the  pieces  remain  unchanged  and  their  relative  ordering  is  unchanged.  That  is,  the  well- 
ordered  optimum  schedule  and  hence  the  overhang  remains  unchanged.  Thus  the  graph  is 
not  minimal.  | 

Lemma  6.20  For  a  connected  minimal  graph  G  that  is  an  ascent  with  overhang  =  A;,  all 
MMBNPSs  of  G  have  borrowing  =  -k. 

Proof:  Assume  to  the  contrary  that  the  first  MMBNPS  piece,  5  =  [xi,...,®*]  is  a 
(6,p)  piece  and  that  it  has  borrowing  b  >  —k.  So  the  situation  is  that  shown  on  the  left 
in  Figure  5.18,  where  the  initial  MMBNPS  5  does  not  dip  down  the  furthest.  Hence, 
G  \  (5  U  r'(5))  must  have  overhang  =  k  +  p. 

Let’s  find  a  maximum  matching  in  the  graph  induced  by  (5  U  r'(5)).  It  will  be  of  size 
2.  In  G,  other  than  these  z  matching  edges,  delete  every  other  arc  from  nodes  in  the  set 
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Figure  5.18:  Charts  for  proof  of  Lemma  5.20 


S.  This  will  give  us  a  modified  graph  G'.  G'  will  now  be  G  \  (5  U  r'(5))  plus  |r'(5)|  —  |5| 
isolated  Y  nodes  and  |51  components  each  consisting  of  an  X  node,  a  Y  node  and  an  edge 
between  them — (-1,0)  pieces.  The  chart  for  this  will  now  look  like  the  chart  on  the  right 
of  Figure  5.18.  G'  still  has  overhang  =  k,  and  hence  G  could  not  have  been  minimal.  | 


Lemma  6.21  Let  G  be  a  graph  which  consists  of  many  separate  connected  minimal  sub¬ 
graphs  that  are  ascent  subgraphs,  and  which  each  have  overhang  =  k.  Then  G  is  minimal 
and  has  overhang  =  k. 

Proof:  From  our  merging  algorithm  and  Lemma  5.20,  we  know  that  the  overhang  of  the 
combined  graph  is  k.  If  we  delete  any  edge,  then  by  scheduling  the  subgraph  that  contained 
that  edge  before  the  other  subgraphs,  we  can  find  a  schedule  of  overhang  =  k  -  1.  | 

By  Transposition: 

Corollary  5.22  Let  G  be  a  graph  which  consists  of  m  minimal  graphs  (Gi , . . . ,  Gm)  which 
are  descent  subgraphs,  and  which  each  have  overhang  =  k  and  |Xj|  —  |li|  =  j.  Then  G  is 
minimal  and  has  overhang  =  m{k  —  j)  +  j  and  |X|  —  |y|  =  mj.  | 

But  these  two  flavors  of  minimal  graphs  don’t  mix: 

Lemma  5.23  For  a  minimal  graph  G  with  overhang  =  k,  there  cannot  be  edges  in  both 
ascent  and  descent  components. 
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Figure  5.19:  subgraphs  that  are  minimal 


Proof:  Assume  that  the  chart  dips  to  a  level  of  -k  during  either  the  plateau  or  the 
descent.  Then,  similarly  to  5.20,  we  delete  all  edge  in  the  ascent  except  for  a  maximum 
matching  within  the  ascent.  In  this  modified  graph,  the  matching  edges  from  the  ascent 
must  now  be  in  the  plateau.  The  modified  graph  still  has  overhang  =  k. 

Hence,  if  the  chart  of  a  minimal  graph  dips  to  —k  during  either  the  plateau  or  descent, 
then  there  are  no  edges  in  the  ascent.  By  transposition,  if  the  chart  of  a  minimal  graph 
dips  to  —k  during  either  the  plateau  or  ascent,  then  there  are  no  edges  in  the  descent. 
Since  the  must  dip  to  —k  somewhere,  then  there  either  are  no  edges  in  the  ascent  or  no 
edges  in  the  descent.  I 

But,  we  can  pepper  either  flavor  of  minimal  graph  with  any  number  of  isolated  edges: 

Lemma  6,24  Let  G  be  a  minimal  graph  which  has  overhang  =  k.  Add  to  G  two  new 
nodes  x  and  y  and  a  single  edge  between  them  (x,y),  yielding  G'.  Then  G'  is  minimal  and 
has  overhang  =  k. 

Proof:  If  the  edge  deleted  is  (®,  y),  then  y  is  released  immediately,  and  by  scheduling  x 
last,  we  will  have  reduced  the  overhang  by  1,  If  the  edge  deleted  is  not  (x,y),  then  schedule 
X  and  y  last,  and  we  wiU  still  have  reduced  the  overhang  by  1.  | 

So  Figure  5.19  shows  two  diflFerent  minimal  subgraphs  of  the  same  graph. 
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Games 


6.1  A  Game  Theory  Problem 

The  triangularization  problem  can  also  be  framed  as  a  two  person  game  in  the  Conway 
sense  [BCG82,  Con76]. 

Consider  the  following  game:  There  is  a  directed  graph  with  Blue  nodes  and  Red  nodes. 
Blue  and  Red  alternate  turns.  On  a  player’s  turn  he  must  remove  from  the  graph  one  node 
of  his  color  that  has  indegree  0;  all  edges  adjacent  to  the  node  are  also  deleted.  The  first 
player  to  have  no  legal  moves  on  his  turn  is  the  winner. 

Our  triangularization  problem  PLTi  corresponds  to  the  special  case  of  this  game  in 
which  all  of  the  directed  arcs  go  from  Blue  nodes  to  Red  nodes  and  Blue  goes  first.  Can 
Blue  win?  What  is  his  winning  strategy?  Note  that  in  this  special  case,  Red’s  choices  are 
irrelevant  to  the  outcome  of  the  game. 

Thus  the  question  immediately  arises:  What  use  can  we  make  of  the  theory  of  games 
that  Conway  et  al  develop?  In  Conway’s  theory  he  develops  an  algebra  of  position  values. 
That  is,  each  game  position  is  assigned  a  value  which  has  three  properties: 

1.  ADD:  The  value  of  the  sum  of  two  games  can  be  computed  from  the  values  of  the  two 
games. 

2.  AUGMENT:  The  value  of  a  position  can  be  computed  from  the  values  of  its  children 
positions. 

3.  WINNER:  Given  the  value  of  a  position,  one  can  determine  who  would  win  the  game 
if  Blue  started  and  who  would  win  the  game  if  Red  started. 
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6.2  Misere  Play 

Unfortunately,  his  algebra  is  intended  for  normal  play  games.  That  is,  games  where  the 
player  who  is  stuck  loses,  in  contrast  to  misere  play  where  the  player  who  moves  last  loses. 
The  theory  does  not  extend  to  misere  play.  In  fact,  no  general  value  scheme  for  misere  play 
is  known  —  you  need  to  essentially  encode  the  entire  position  in  your  value. 

Our  game  is  a  misere  play  game.  Yet  we  can  devise  an  algebra  that  satisfies  all  three 
properties  and  doesn’t  encode  the  entire  problem  in  it.  That  is,  it  is  very-many-to-1. 

For  the  value  of  a  position  we  will  just  use  the  fundamental  chart  of  any  optimal  schedule 
of  the  corresponding  graph. 

There  is  an  ADD  procedure  that  is  used  in  adding  two  positions:  ADDing  two  indepen¬ 
dent  games  is  the  same  thing  that  we  earlier  called  merging  two  fundamental  charts. 

There  is  an  AUGMENT  procedure,  which  only  needs  to  look  at  Blue’s  possible  moves: 
How  do  we  compute  the  value  of  a  position  given  the  values  of  its  children?  Let  , . . .  wj,  be 
the  values  of  the  children  position  for  each  of  Blue’s  valid  moves.  We  can  easily  compute 
vj , . . . ,  Wfc,  the  values  of  the  initial  position,  assuming  that  move  t  is  Blue’s  best  move.  Then 
we  just  need  to  pick  out  the  best  one  of  the  v^.  We  already  showed  how  to  pick  out  the  best 
chart  from  a  set  of  charts  when  we  were  solving  weighted  chains  in  Section  3.4. 

How  do  we  determine  the  WINNER  from  the  value? 

If  it  is  Blue’s  turn:  If  the  fundamental  chart  dips  below  —1,  then  Red  (Y)  was  stuck  at 
some  point,  and  Blue  (X)  loses.  If  the  chart  never  dips  below  -1,  and  ends  at  -1, 
then  Red  is  stuck  and  Blue  loses.  Otherwise,  Blue  will  be  stuck  first,  and  so  Blue 
wins. 

If  it  is  Red’s  turn:  If  the  fundamental  chart  is  ever  below  1,  then  Red  (Y)  was  stuck  at 
some  point,  and  Blue  (X)  loses.  Otherwise,  Blue  will  be  stuck  first,  and  so  Blue  wins. 

So  our  value  satisfies  all  three  properties. 

6.2.1  A  Misere  Hackenbush  variant 

Obviously  our  algebra  can’t  be  extended  to  cover  arbitrary  misere  games.  However,  there 
is  one  form  of  misere  Hackenbush  that  it  can  handle:  misere  childish  red-blue  hackenbush 
with  stalks  that  have  blue  stems  and  red  tips.  That  is,  starting  from  the  ground,  the  edges 
are  blue.  At  some  point  they  turn  red  and  stay  red,  all  the  way  to  the  tips. 
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6.3  Normal  Play 

When  we  play  in  a  non-Misere  fashion,  we  can  make  full  use  of  the  system  of  assigning 
values  to  games  developed  by  Conway. 

Definition  6.1  For  a  bipartite  graph  G  =  (X,Y,E),  &  Y -dominating  set  is  a  set  X'  C  X 
such  that  for  any  y  EY  there  is  some  x  £  X'  such  that  {x,y)  6  E,  i.e.  r(X')  =  Y . 

Lemma  6.1  If  G  is  such  that  X  ie  a  minimal  y.dominating  set  for  Y,  then  the  value  of  G 
is  0. 

Proof:  If  it  is  Y's  turn  to  move,  he  has  no  legal  moves  and  he  loses.  If  it  is  X’s  turn 
to  move,  then  each  X  node  that  he  removes  wiU  release  at  least  one  Y  node,  and  hence  he 
will  ultimately  lose.  A  game  in  which  whoever  starts  is  the  loser  is  a  0  game.  | 

Lemma  6.2  If  G  consists  of  an  isolated  Y  node,  then  G  has  the  value  —1. 

Proof:  If  it  is  Y'b  turn,  his  move  must  leave  an  empty  graph.  An  empty  graph  is  a 
special  case  of  Lemma  6.1  and  thus  has  value  0.  If  it  is  X’s  turn,  he  has  no  legal  move.  So 
the  value  of  G  is  {  |  0}  =  —  1.  I 

Lemma  6.3  If  G  has  a  minimum  y-dominating  set  of  size  d  >  1,  then  the  value  of  the 
game  is  |X|  —  d. 

Proof:  We  wiU  use  induction  on  |X|.  The  case  when  |X|  =  1  and  hence  d  =  1  is  covered 
by  Lemma  6.1. 

The  case  when  —  d  =  0  is  also  covered  by  Lemma  6.1,  so  we  only  need  to  consider 
the  general  case  where  |Jf|  —  d  >  0.  Once  again,  since  there  exists  a  y -dominating  set, 
player  Y  has  no  legal  moves.  But  X  has  three  types  of  moves;  moves  that  leave  a  minimum 
y-dominating  set  of  size  d;  moves  that  release  a  Y  node,  thus  reducing  the  size  of  the 
minimum  y-dominating  set  to  d  —  1;  and  moves  that  don’t  do  either. 

Since  jXl  -  d  >  0,  moves  of  the  first  type  always  exist;  these  moves  are  when  we  remove 
some  X  node  that  is  not  in  all  of  the  minimum  y-dominating  sets  of  size  d.  They  leave  a 
graph  G'  with  |X'|  =  |X|  -  1  and  [X'l  -  d=lX|-d-l  and  hence,  by  induction,  with  value 
|X|-d  -  1. 
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Moves  of  the  third  type  are  when  we  remove  a  node  that  is  in  all  of  the  minimum  Y- 
dominating  sets  of  size  d,  and  yet  it  doesn’t  release  a  Y  node.  Hence,  it  leaves  a  graph  G' 
with  d'  >  d  and  |X'|  -  d'  <  |X|  -  d  -  1  and  hence,  by  induction,  with  value  <  |X|  -  d  -  1. 
This  type  of  move  is  never  desirable  for  X. 

After  a  move  of  the  second  type,  the  graph  G'  has  two  parts.  It  has  j  >  1  isolated 
Y  nodes  and  a  piece  with  no  isolated  Y  nodes,  which  we  will  call  G".  It  is  the  case  that 
d"  =  d-1  and  |X"|  =  IXl  — 1.  So,  by  induction  on  IXl,  the  value  of  (?"  is  \X"\-d"  =  |X|  — d. 
The  value  of  G'  is  |X|  —  d  —  j  <  |Xl  -d-1. 

Since  X  can  always  leave  a  game  of  value  v  —  |X|  —  |d|  —  1  and  never  leave  a  game  of 

larger  value,  the  overall  value  of  the  game  G  is  {u  |  }  =  |X|  —  d.  | 

So,  we  can  find  the  value  of  a  game  iff  we  can  find  its  minimum  y-dominating  set. 
Unfortunately,  this  turns  out  to  be  iVP-Complete. 

y-DOM-SET 

INSTANCE:  A  bipartite  graph  G  =  (X.y,  E)  and  a  bound  k. 

QUESTION:  Is  there  a  set  X'  C  X  such  that  \X'\  <  k  and  r(A’')  =  Y1 

Lemma  6.4  y-DOM-SET  is  AP-Complete  —  even  if  all  elements  of  Y  have  degree  <  2. 

Proof:  Reduction  from  minimum  vertex  cover  set.  Given  an  instance  of  vertex  cover 
set,  G=(V,E)  and  a  bound  k.  Is  there  a  set  V'  CV  such  that  |F'|  <  k  and  for  every  edge 
e  =  (vj,w,)  G  E,  either  Vi  G  V’  or  Vj  G  V'l 

We  convert  this  into  an  instance  of  T-dom-set  G'  =  (X',Y',  E')  as  follows:  For  every 
vertex  Vi  &  V  create  a  node  n  ^  X.  For  every  edge  ei  E  E  create  a  node  yi  E  Y,  If 
vertex  Vi  is  an  endpoint  of  edge  Cj,  then  put  an  edge  in  E'  between  Xi  and  yj.  Is  there  a 
y-dominating  set  of  size  <kl  | 
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We  have  taken  the  triangularization  problem,  recast  it  in  a  number  of  different  forms,  and 
twisted  it  this  way  and  that  holding  it  up  to  a  variety  of  light  sources.  What  have  we 
learned  from  this? 

We  have  seen  that  the  triangularization  problem  is  very  closely  related  to  a  number  of 
scheduling  problems  that  demand  to  be  solved  as  computer  engineers  try  to  leave  less  of 
their  available  hardware  idle.  It  is  easy  to  believe  that  some  of  the  tools  we  have  honed  in 
this  thesis  will  prove  useful  in  dealing  with  those  scheduling  problems. 

We  have  seen  a  number  of  iVP-complete  problems  that  are  very  close  to  the  triangular¬ 
ization  problem.  Perhaps  someone  else  can  close  the  gap  and  show  that  triangularization 
is  JVP-complete.  Such  a  proof  would  probably  require  some  very  interesting  methods. 

We  have  developed  a  tool  for  segregating  schedules.  The  first  thing  that  this  allowed 
us  to  do  is  break  up  schedules  into  a  number  of  independent  sub-schedules.  Ultimately, 
this  led  us  to  well-ordered  optimum  schedules,  fundamental  charts,  and  nearly  well-ordered 
schedules. 

In  fundamental  charts,  we  have  found  a  very  simple  description  of  a  graph.  It  is  a 
description  that  abstracts  away  all  the  unnecessary  and  confusing  chaos  of  the  wild  crossing 
of  edges  from  node  to  node.  It  is  also  a  description  that  allows  us  to  merge  solutions  for 
independent  graphs  in  polynomial  time,  without  reconsidering  the  interconnections  of  edges 
and  nodes. 

Besides  being  fascinating  in  their  own  right,  these  properties  form  the  underpinnings 
of  our  polynomial  algorithm  to  schedule  weighted  chains.  We  have  also  shown  that  graphs 
with  all  Y  nodes  of  degree  <  2,  graphs  with  all  X  nodes  of  degree  <  2,  and  graphs  that 
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are  trees  are  aU  solvable  in  polynomial  time.  However,  more  interesting  than  any  of  these 
is  the  polynomial  algorithm  for  solving  PLATEAU  pieces  with  overhang  <  k. 

The  next  hardest  problem  to  try  to  get  a  polynomial  algorithm  for  is  graph  that  are 
weighted  trees.  We  have  seen  that  both  weighted  chains  and  unweighted  trees  can  be  solved 
in  polynomial  time.  Even  a  solution  for  weighted  binary  trees  would  be  very  interesting. 

While  our  approximation  results  seem  discouraging,  it  should  be  remembered  that  these 
dealt  with  worst-case  behavior  of  approximation  algorithms.  Practical  approximation  al¬ 
gorithms  should  be  pursued,  but  this  can  only  be  done  effectively  with  a  source  of  real 
instances. 

Lastly,  better  lower  bound  techniques  need  to  be  pursued.  For  example,  is  there  a  family 
of  graphs  with  all  Y  vertices  of  degree  <  3  whose  overhang  is  n(n)?  (We  have  proved 
Lemma  4.12  that  the  overhang  of  such  problems  is  at  most  about  n/4.)  Any  progress  here 
will  aid  our  understanding  of  the  triangularization  problem. 

Many  aspects  of  this  problem  remain  elusive.  Something  interesting  is  going  on  here. 
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